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Audience 


This reference manual is for system designers‘and pr 


mmiers who use the 
Alpha 21164 microprocessor. : 


Content 


» Chapter"10 provides information about thermal management 
considerations. 


Chapter 11 provides mechanical data and packaging information, including 
signal pin lists. 


Chapter 12 describes chip and system testability features. 


Xxi 


¢ Appendix A summarizes the Alpha AXP instruction set. 


















e Appendix B summarizes the 21164 specifications. 
e Appendix C lists changes and revisions to this manual. 


¢ Appendix D provides phone numbers for support and lists 
publications with order information. 


e The Glossary lists and defines terms associated with th 2111 
The companion volume to this manual, the Alpha Arc} ; 
Manual, contains the Alpha AXP Architecture informagion Soa 
Terminology and Conventions 
The following sections describe the terminolo Val 
manual. 
Numbering 


All numbers are decimal unless othe 
numbers other than decimal are indi 








uly privileged software (that is, software running in kernel 
can trigger UNDEFINED operations. Unprivileged software cannot 
igeer UNDEFINED operations. However, either privileged or unprivileged 
tware can trigger UNPREDICTABLE results or occurrences. 


xxii 


UNPREDICTABLE results or occurrences do not disrupt the basic operation 
of the processor. The processor continues to execute instructions in it$:normal 


manner. In contrast, UNDEFINED operations can halt the processor © 
it to lose information. 


The terms UNPREDICTABLE and UNDEFINED can be fu 
follows: 


UNPREDICTABLE 


" process, or on a sequence of actions of different processes. 

















Results or occurrences specified as UNPREDIGLA 
moment to moment, implementation to implem 
instruction within implementations. Software ¢an 1 
specified as UNPREDICTABLE. : 


An UNPREDICTABLE result may ac 


current access mode. UNPREDI g 
their previous values. 


exceptions. 


An occurrence specifie s 


of processor temporary registers left behind by some previously running 
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UNDEFINED 
e Operations specified as UNDEFINED may vary from moment to mot 






stopping system operation. 
e UNDEFINED operations may halt the processor or caus 
processor to hang, that i is, reach an unhalted state fr 

transition to a normal state in which the machine; 


Only privileged software (that is, software running 
trigger UNDEFINED operations. 


Data Field Size 











naturally aligned longword. 
Ranges and Extents 







and are inclusive. For exampl 
integers 0, 1, 2, 3, and 4. 


: 'N is stored at a byte address that is not a multiple of 
tiled UNALIGNED. 
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Register Format Notation 








Some registers are followed by a description of each field. The fields 0 
register are labeled with either a name or a mnemonic. The déeseripts 





The “Type” column in the field description includes both th 
field, and an optional initialized value, separated from. 
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Table 1 Register Field Type Notation 


Notation 
RC 


RO 


RW 


Wwoc 


W1C 


WA 
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Description 


A read-to-clear field. The value is written by hardware“and 
unchanged until read. The value may be read by software : 
point, hardware may write a new value into the fie} 









‘to be cleared by 
hardware. Software write BORer not modify the state of 


the bit. 





the bit. 


A write-anythi 
allowed to th 


“The value may be written by software 
ardware. Read operations by software return an 






the type of the unnamed fields in the register. 


Table 2 Register Field Notation 


Notation 
IGN 





Description 


Register bits specified as ignore (IGN) a j 


MBZ 


SBZ 













are UNPREDICTABLE when read if not 


Register bits specified as MBZ (must be: 
software with a non-zero value. If the'px 
value in a field specified as MBZ, an UNDE! 
result. a8 


Register bits specified as RA‘ 








}teturn a zero when read. 


Guld be zero) should be filled by 
values in SBZ fields produce 
produéé extraneous instruction-issue 


Register bits specified as.s 
software with a zero valt 
UNDEFINED results and 
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d, i, multiple instruction issue, 
ae from many operating systems. 







are either load: a 
registers. 






sources aebes it easy to build implementations that issue multiple 
instructions every CPU cycle. 
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The 21164 uses a set of subroutines, called privileged architecture libragy 

code (PALcode), that is specific to a particular Alpha AXP operating system 
implementation and hardware platform. These subroutines provide operatit 
system primitives for context switching, interrupts, exceptions, an: 
management. These subroutines can be invoked by hardware or: ee 
instructions. CALL_PAL instructions use the function field ofthe instruction 
to vector toa specified subroutine. PALcode is written in stand, d machine 

























multi-instruction atomic sequences. 


The Alpha AXP architecture performs byte shifti 
64-bit, register-to-register instructions; it does #0t* 
store instructions. : 


. 1.1.1 Addressing 


Virtual addresses as seen by the 
addresses by the memory man 


Data Type 
Byte 


Word aes ae ate start at an arbitrary a boundary. A 


is it value. A word is supported in ee AXP architecture 


arid by ea arithmetic instructions. 


A quadword is 8 contiguous bytes that start at an arbitrary byte 
boundary. A quadword is supported in Alpha AXP architecture by load 
and store instructions and quadword integer operate instructions. 
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Note 
















Alpha Architecture Reference Manual for details. 


1.1.3 Floating-Point Data Types 
The 21164 recognizes the following floating-point di 
¢ Longword integer format in floating-point uni 
¢ Quadword integer format in floating-poi 
e IEEE floating-point formats 
— § floating 
— T floating 

¢ VAX floating-point formats 
— F floating 
— G_floating 


: alee ies of the memory subsystem further reduce the average CPI. 


The 21164 and associated PALcode implements IEEE single and double 
precision, VAX F floating and G_floating data types, and supports longword 
:2:bit) and quadword (64-bit) integers. Byte (8-bit) and word (16-bit) support 


is provided by byte manipulation instructions. Limited hardware support is 
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Other 21164 features include: 


e A peak instruction execution rate of four times the input cloé! 
= 


64, or 512 8K-byte pages. The size of cok 
: specified by hint bits stored in the entry; 


executing both Digital and IEEE Poa 5 


e An on-chip, 8K-byte virtual inst Teh 
ASN=127). 


e An on-chip, dual-read-ported, 
° An on- as write buffer 


A 128-bit data 
support. 


nterface to support chip and module level testing. 
A 3.3-V power supply. (Direct connection to 5-V logic supported.) 


to Chapter 9 for 21164 dc and ac electrical characteristics. Refer to 


Alpha Architecture Reference Manual for a description of address space 
numbers (ASNs). 
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ign examples 
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2.1 Alpha 21164 Microarchitecture 


The Alpha 21164 Microprocessor is a high-performance implementation ; 
Digital’s Alpha AXP architecture. The following sections provide ap, 1 
of the chip’s architecture and major functional units. 


Figure 2—1 is a block diagram of the 21164. 
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Figure 2-1 Alpha 21164 Microprocessor Block Diagram 
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Address 






The Alpha 21164 microprocessor consists of the following internal sections 
(Figure 2-1): 
















¢ Clock generation logic 
¢ Instruction fetch and decode unit (Ibox), which includes: 


Instruction prefetcher and instruction decoder 
Branch prediction 
Instruction translation buffer 
Interrupt support 


e Integer execution unit (Ebox) 
¢ Floating-point execution unit (Fbox) 


¢ Memory address translation unit (Mbox), whi 


Miss address file (MAF) 
Write buffer 
Deache control 





¢ Cache control and bus interfacé anit: ) with interface to external cache 


e Data cache (Deache) 


¢ Instruction cache Meech 


Register conflict logic 


terrupt and exception logic 
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2.1.1.1. Instruction Decode and Issue 





The Ibox decodes up to four instructions in parallel and checks that € 
required resources are available for each instruction. The Ibox issues‘only 


later instruction and not for an earlier one. 
In other words: 


Each fill occurs when the instruction buffer stage in the Ibox pipeline requires 
a new INT16. The INT16 is written into the Icache and the instruction buffer 





a new INT16, and on availability of data in the refill buffer. 
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Once an Icache miss occurs, the Icache enters fill mode. When the Icach 
is in fill mode, the refill buffer is checked each cycle to see if it contains‘the 
next INT16 required by the instruction buffer. When the required data is* 
available i in the refill buffer, the Icache is checked for a hit while itawaits 














the Icache returned to access mode. 


2.1.1.3 Branch Execution 
When a branch or jump instruction is fetched a i 






and begin fetching the target Perea te 
instruction is issued. 


The Icache records the outcome of | es 
provided for each instruction local 


status is not initialized on Icaé 
that was evicted from the I 







updated accordingly. On branch mispredict, a 
the Ibox restarts execution from the correct PC. 


21164 uses the Icache index hint in the JMP and JSR instructions to 
dict the target PC. The Icache index hint in the instruction’ s displacement 


are oi rmed from the data in the Icache tag store at that index. Later in the 
pipeline, the PC prediction is checked against the actual PC generated by the 
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Ebox. A mismatch causes a PC mispredict trap and restart from thé.correct 
PC. This is similar to branch prediction. 


The RET, JSR_COROUTINE, and HW_REI instructions predict, fhe 1 né 
using the index from the subroutine return stack. The upper. bi 











predictions are checked. 


Changes from PALmode to native mode and vice ver 


2.1.1.4 Instruction Translation Buffer — : 
The Ibox includes a 48-entry, fe a 


ans of a 7-bit ae field in each ITB entry. PALcode, which supports 
yerations to the architecturally defined TBIAP register, does so by using 
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The 21164 provides two optional translation extensions called superpagés 
Access to superpages is enabled using ICSR<SPE> and is allowed only W. le 
executing in privileged mode. 


¢ One superpage maps virtual address bits <39:13> to physic 
<39:13>, on a one-to-one basis, when virtual address bits < 
2. This maps the entire physical address space four time 
quadrant of the virtual address space. 





mode. 











2.1.1.5 Interrupts 
The Ibox exception logic supports th 
¢ Hardware interrupts 


There are seven level-sensiti 
following signals: 







as aah (see Section 5.1.22). 
aps (ASTs) 





enable registers. In addition, AST interrupts are qualified by the current 
‘processor mode. 
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Interrupts are masked by the hardware interrupt priority level (IPLi}.register 
(see Section 5.1.18). In addition, AST interrupts are qualified by the 
processor mode. The serial line interrupt, the internally detected corregtable 
error interrupt, the performance counter interrupts, and irg_Ke8:0> are 
all maskable by bits in the Ibox control and status register: 

Section 5.1.17). All interrupts are disabled when the pro 
PAL code. 
































is executifi 


2.1.2 Integer Execution Unit 


The integer execution unit (Ebox) contains two 6 
pipelines, EO and E1, which include the following 


¢ Two adders 
¢ Two logic boxes 
¢ A barrel shifter 


rexecution 





¢ Byte manipulation logic 
e An integer multiplier 


The Ebox also includes the 40:8 4 er register file (IRF) that 
contains the 32 integer register “bysthe Alpha AXP architecture and 8 


The on-chip, si { 
VAX floatingspiit: mn 


ingepoint multiply pipeline. 


A floating-point add pipeline—The floating-point divide unit is associated 
with the floating-point add pipeline but is not pipelined. 


Gnstrtietions is four cycles. 


Preliminary Edition—September 1994 2-9 


The floating-point register file (FRF) has five read ports and four write sorts. 
Four of the read ports are used by the two pipelines to source operands. :“£t 
remaining read port is used by floating-point stores. Two of the write port 
are used to write results from the two pipelines. The other two write:port 
used to write fills from floating-point loads. 











2.1.4 Memory Address Translation Unit 
The memory address translation unit (Mbox) contains thr 
¢ Data translation buffer (dual ported) 
¢ Miss address file 
¢ Write buffer address file 


The Mbox arbitrates between age see loads tha Rit. in the Deache and 
in that orifyone register is 
loads that conflict with Cbox 














fills for use of these write ports a are forces 
Chox fill can occur. 


entry supports all fe 
translation for up to 5 






as bit virtual address is presented to the DTB. If 
ual address is cached in the DTB, the page frame 


fhe virtual address space. 
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must ensure that virtual addresses be mapped either Maas a singh DTB 
entry or through superpage mapping. Multiple simultaneous mappin 

cause UNDEFINED results. The only exception to this rule is:tt 
page may be mapped twice with identical data in two diffe 
This occurs in eS systems, such as OpenVMS, whic 





once in the primary handler. The PTE. mapping th 
remain constant during accesses to this page to mé 


2.1.4.2 Load instruction and the Miss Address File 


rea i in the miss address file (MAF). 


T :: When a load miss occurs, each 
MAF entry is checked to sé 
Deache eee piock 


‘Regardless of the Dcache state, the Mbox forwards the data 
ache control and bus interface unit (BIU). 


peline creates a conflict if both the load and store operations access the 
Same memory location. (The store instruction has not yet updated the location 
when the load instruction reads it.) This conflict is handled by forcing the load 
instruction to replay trap; that is, the Ibox flushes the pipeline and restarts 
‘écution from the load instruction. By the time the load instruction arrives 


Preliminary Edition-September 1994 2-11 


at the Deache the second time, the conflicting store instruction has written the 
Deache and the load instruction is executed normally. 
























Replay traps can be avoided by scheduling the load instruction to i 


2.1.4.4 Write Buffer 


The Mbox contains a write buffer that has six 32-byte 
buffer provides a finite, high-bandwidth resource for re 
minimize the number of CPU stall cycles. The write b: 
WMB instruction are described in Section 2.7. 


2.1.5 Cache Control and sei Interface cat 


titerface functions, 
iehack 5 caching. It controls 
¥d-level backup cache 
imary Deache read misses, 
ite buffer into the shared 


also coubrols the 128-bit bidirectional 
Chapter 4 describes the external 


& primary data cache (Dcache), a primary 
instruction cache (Icaé] econd-level data and instruction cache 
(Scache). All me: 


CMOS structure: 


: he (Deache) i is a dual-read-ported, single-write-ported, 8K-byte 
is # write-through, read-allocate, direct-mapped, physical cache with 
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2.1.6.2 Instruction Cache 


The instruction cache (Icache) is an 8K-byte, virtual, direct-mapped ‘éache. 
Each block tag contains: 












A 7-bit address space number (ASN) field as defined by th 
architecture 


e A 1-bit address space match (ASM) field as defined by 
architecture 


e A 1-bit PALcode (physically addressed) eee 


memory. 


2.1.6.3 Second-Level Cache 


nstructions. The Tbox aintaine state for all pipeline ciaes to track 
* outstanding register write operations. 
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Figure 2-2 shows the ma Pe memory reference, and floating-péint 














used as a source operand of a following instruction before it is 
register file. 


Tables 2—1, 2-2, 2-3, 2-4, 2-5, and 2-6 provide examples ofc ave 
stages of pipelining during instruction execution. 


Table 2-1 Pipeline Examples—All Cases | 
Pipeline Stage Events 






0 Access Icache ie 38 and data. 
1 
2 
capable of executing hon : 
in this che cannot issue simt 
3 


Table 2-2 Pipeline 
Pipeline Stage 
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Figure 2-2 Instruction Pipeline Stages 
























Instruction Cache Read 


Instruction Buffer, Branch Decode, 
Determine Next PC ‘ 


Slot by Function Unit 





Register File Access Checks 
Integer Register File Acce: 


Integer 
Operate 


ee mplete in pipeline 
First Integer ey). C-MOV completes 


Operate Stage 


=. se . 2. 
vole latency. 
If Needed, Second Integer < | 4 ees ; 
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Table 2-3 Pipeline Examples—Floating Add 
Pipeline Stage Events 


Read the FRF. 

First stage of Fbox add pipeline. 
Second stage of Fbox add pipeline. 
Third stage of Fbox add pipeline. 
Fourth stage of Fbox add pipeline. Write the” 
















omonN Daa fs 


Pipeline Stage 


4 

5 tag store access. Detect Deache hit. 
Format the data ache arbitration defaults to pipe EO 
in anticipations 

6 Write the IR. 
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Table 2-5 Pipeline Examples—Load (Dcache Miss) 
Pipeline Stage Events 


4 Calculate the effective address. Begin the Deache d 
access. 


5 Finish the Deache data and tag store access.<! 


10 
the data as aud 


Finish the Deac 








les while: “waiting for a resource or stalling for other reasons. Dynamic 
tages (Ebox and Fbox) always advance state and are unaffected by any stall in 
the pipeline. A pipeline stall may occur while zero instructions issue, or while 
ome instructions of a set of four issue and the others are held at the issue 
tage. A pipeline stall implies that a valid instruction is (or instructions are) 
presented to be issued but cannot proceed. 
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Upon satisfying all issue requirements, instructions are issued into theik. 
slotted pipeline. After issuing, instructions cannot stall in a subsequent: 
pipeline stage. The issue stage is responsible for ensuring that all resoure 







means of stopping instructions after the issue stage is an abort 
term abort as used here is different from its use in the Alpha, 
Reference Manual.) 
2.2.2 Aborts and Exceptions 


Aborts may result from a number of causes. In gener# 


the preceding paragrap si the nature of some exception conditions, this 
may occur as lates#s:th : e register file (IRF) write cycle. In the case of 


of the exceptional instruction or the immediately 
a8 latched in the EXC_ADDR internal processor 
.In the case of an arithmetic exception, EXC_ADDR contains 
he instruction immediately after the last instruction executed. 


._ Inizediately following the last ietrastion executed. For the remaining cases, 
ADDR points to the exceptional instruction; where in all cases its 
tion should naturally restart. 
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outstanding write operations to both the IRF and FRF have completed 
outstanding instructions have passed the point i in the pipelin s 


check. 


Replay traps are aborts that occur when an instruction & 
that is not available at some point in the pipeline. 
resources whose availability could not be anticipats 


he following sections define the classes of instructions and provide rules for 
struction slotting, instruction issuing, and latency. 
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2.3.1 Instruction Class Definition and Instruction Slotting 


The scheduling and multiple issue rules presented here are performancé 
related only; that i is, there are no functional dependencies related to sche 
























determine which instructions can issue in a given cycle. 


Table 2-7 Instruction Classes and Slotting 
Class Name __s Pipeline Instruction List 
LD E0! or E1? All loads except L 


ST EO 
MBX EK0 
RX EO 
MXPR EO or E1 
(depends on 
the IPR) x 
IBR El integer conditional branches 
FBR FA? poli t:conditional branches 
JSR E1 to-subroutine instructions: JMP, JSR, RET, or 


QROUTINE, BSR, BR, HW_REI, CALLPAL 






IADD “ADDL, ADDL/V, ADDQ, ADDQ/V, SUBL, SUBL/V, 
SUBQ, SUBQ'V, S4ADDL, S4ADDQ, S8ADDL, 
S8ADDQ, S4SUBL, S4SUBQ, S8SUBL, S8SUBQ, 
EDA, LDAH 

ILOG AND, BIS, XOR, BIC, ORNOT, EQV 


SLL, SRL, SRA, EXTQL, EXTLL, EXTWL, EXTBL, 
EXTQH, EXTLH, EXTWH, MSKQL, MSKLL, 
MSKWL, MSKBL, MSKQH, MSKLH, MSKWH, 
INSQL, INSLL, INSWL, INSBL, INSQH, INSLH, 
INSWH, ZAP, ZAPNOT 


(continued on next page) 
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Table 2-7 (Cont.) Instruction Classes and Slotting 


Class Name __s Pipeline Instruction List 

CMOV EK0 or E1 CMOVEQ, CMOVNE, CMOVLT, CMé 
CMOVGE, CMOVLBS, CMOVL 

ICMP EO or E1 CMPEQ, CMPLT, CMPLE, C 
CMPBGE 

IMULL EKO MULL, MULL/V 

IMULQ E0 MULQ, MULQ/V 

IMULH EO UMULH 

FADD FA 


FDIV FA 
FMUL FM‘ 
FCPYS - FM or FA 
MISC — EO 
UNOP none 







4Fbox multiply pipeline. 
5UNOP is LDQ_U R31,0(Rx). 


sfotting 





static Sa, 
slotting functi 


The next integer instruction! in this INT16 can issue only in EO. 


n this context, an integer instruction is one that can issue in one or both of E0 or E1, 
but not FA or FM. 
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If the current instruction is one that can issue in either FA or FM, assign 
it to FA unless FA is not free. If it is an FA-only instruction, it must 
assigned to FA. If it is FM-only instruction, it must be assigned to FM 



















next sequential instruction. Stop when an instruction cann 
in an execution pipeline because any pipeline it can use is 

















when a rule would be violated by allocating the next i ; 
pipeline: 


e An instruction of class LD cannot be simult 
instruction of class ST. 





After a predicted not-taken IBR or FBR; 
can be slotted together. 


one cycle. The second two are sent only when the first 
fe..both issued, provided no other slotting rule would prevent the 
‘wo from being slotted in the same cycle. 
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2.3.2 Coding Guidelines 













stalls (split-issue), thus preventing [c] and [d] from advan¢é 
stage: 


Code example showing bad ordering 
(1) [a] LDL R2,0(R1) 
(3) [b] ADDL R2,R3,R4 


(4) [c] ADDL R2,R5,R6 
(5) [d] ADDL R4,R8,R9 









ducing iSbruction. Most latencies are only a property of the producer. An 
ception is integer multiply latencies. There are no variations in latency due 
which particular unit produces a given result relative to the particular unit 
that consumes it. In the case of integer multiply, the instruction is issued at 


plit-issue i is the situation in which not all instructions sent from the slotting stage 
to the issue stage issue. One or more stalls result. 
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is dependent on which previous instructions produced its operands and 
they executed. 


Table 2-8 Instruction Latencies 





4o Integer 
Class Latency 


LD Deache hits, latency=2. 
Deache miss/Scache hit, latency=8 or longer.” 





ST Store operations produce no result. 


MBX LDx_L always Deache misses, latency depends 
memory subsystem state. 
STx_C, latency depends on memo 
state. ; 















RX RS, RC, latency=1. eycles 
MXPR HW_MFPR, latency=1, 2, or lon; 1 or 2 cycles 
the IPR. 
HW_MTPR, produces no resul: 
IBR Produces no result. (Taken:} — 
minimum = 1 cycle, brane 
5 cycles.) & 
FBR = 
5 cycles.) : : 
JSR All but HW_REI, lateticy=1. : 2 cycles 
IADD 2 cycles 


1The multiplier is unakié to 
but its latency is ineréased by the: 
example, an Ednstruction iss 
(8. + 


arbitratié#: predicts a load miss in EO. If a load actually does miss in EO, it is sent to 
Hately. If it hits, and no other event in the Cbox affects the operation, the requested data is 
in eight cycles. Otherwise, the request takes longer (possibly much longer depending on the 
the and Cbox). It should be possible to schedule some unrolled code loops for Seiche by using 
attern that takes advantage of the Mbox load-merging function, achieving high throughput 


c Ebox bypass paths. The instruction issues at the expected time, 
akes for the input data to become available to the multiplier. For 

i e cycle later than an ADDL instruction, which produced one of its 

i 2). If the IMULL instruction is issued two cycles later than the ADDL 


















(continued on next page) 
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Table 2-8 (Cont.) Instruction Latencies 
























Additional Time B 


Class Latency 

ILOG Latency=1.* 

SHIFT Latency=1. 

CMOV Latency=2. 

ICMP Latency=1.* 

IMULL Latency=8, plus up to 2 cycles of added latency 


depending on the source of the data. 

Latency until next IMULL, IMULQ, or I 
instruction can issue (if there are no dat. 
dependencies) is 4 cycles plus the numbé 
cycles added to the latency. 


IMULQ 1 cycle 
Latency until next IMULL, IMULQ, * 
instruction can issue (if there.are..no 
dependencies) is 8 cycles phi 
cycles added to the latency. ' 
IMULH 1 cycle 
FADD — 
FDIV = 


ed in the same ee The result 
¢ ivide is available, regardless of 


able to receive data from Ebox bypass paths. The instruction issues at the expected time, 
aaed nt the time it takes for the input data to become available to the multiplier. For 
ction issued one cycle later than an ADDL instruction, which produced one of its 
+O (8 + + If the LL instruction is issued two cycles later than the ADDL 





ypass prove an effective latency of 0 (zero) cycles for an ICMP or ILOG instruction producing 
an IBR or CMOV instruction. This is true only when the IBR or CMOV instruction issues 
cycle as the ICMP or ILOG instruction that produced the test operand of the IBR or CMOV 


all other cases the effective latency of ICMP and ILOG instruction is one cycle 


(continued on next page) 
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Table 2-8 (Cont.) Instruction Latencies 


Additional Time Beforé 
Result Available.to Int 


Class Latency Multiply Unit! 


FCPYS Latency=4. — 
MISC RPCC, latency=2. TRAPB produces no result. 1 cycle 
UNOP UNOP produces no result. — 


1The multiplier is unable to receive data from Ebox bypass paths. The instructéen i the expected time, 
but its latency is increased by the time it takes for the input data to become let Xplier. For 
example, an MULL instruction issued one cycle later than an ADDL instructiai; whith prodiicéd one of its 
operands, has a latency of 10 (8 + 2). If the IMULL instruction is issued two cycles: ater thin the ADDL 


instruction, the latency is 9 (8 + 1). 





2.3.3.1 Producer—Producer Latency 


peas ina mult 
cases, producer—cons 
producer-produ 







“wr-rd conflict stalls execution Waiting for R2 
; wxr-wr conflict may dual issue when ADDQ issues 





instructions, even though their latency is 2. For IMUL, FDIV and LD 
é ictions, producer—producer conflicts with any subsequent instruction 
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as it is guaranteed to write the register file at least one cycle after th 
FDIV, or LD instruction. 














If a load writes a ses and within two cycles a subsequ 


pea This causes the second instruction to be replay 
When the second instruction again reaches the issue. 
until the load fill occurs. 


2.3.4 Issue june 


instruction: 


e No instruction can be seta patil al 






was predicted to hit 
instruction iggued. Bt 










reas Sd _C (or HW_ST-cond) instruction has been issued until 
“Mbox writes the success/failure result of the STx_C (HW_ST-cond) in 
ination register. 


No IMUE instructions can be issued if the integer multiplier is busy. 


No floating-point divide instructions can be issued if the floating-point 
divider is busy. 


No instruction can be issued to pipe EO exactly two cycles before an integer 
‘multiplication completes. 
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¢ No instruction can be issued to pipe FA exactly five cycles before a 
point divide completes. 


are prevented from issuing siiomatieall a 
in order. 






2.4 Replay Traps 


hen a store instruction is executed and there are 
fitries allocated. ‘The trap occurs even if the entry 


id. The trap occurs even if the load instruction would 
in the Deache or merged with an MAF entry. 
















1 ‘shared memory model order trap (Litmus test 1 trap): If a load 
struction issues that address matches with any miss in the MAP, the 
Joad instruction is aborted through a replay trap regardless of whether the 
newly issued load instruction hits or misses in the Deache. The address 
atch is precise except that it includes the case in which a longword 

ce ss matches within a quadword access. This ensures that the two loads 
‘écute in issue order. 
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and both access the same location. The address match is exact wi 
to low-order bits of the address, but ignores address bits <4 







that uses the result of that load, and the load misse <4 
the consumer instruction traps and is restarted from t 
the pipeline. This occurs because the sie eee ‘Sti 


2.5.1 Merging Rules 


When a load miss occurs, e 
load miss that addresses th 
merging rules are satisfie 






fily with other even longword load instructions, and longword load 
instructions with odd addresses merge only with other odd longword load 
instructions. 


The MAF does not merge floating-point and integer load misses in the 
same entry. 
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¢ Merging is prevented for the MAF entry a certain number of cycles gfter 
the Scache access corresponding to the MAF entry begins. Merging‘i 
prevented for that entry only if the Scache access hits. The minimu 
number of cycles of merging is three; the cycle in which the 
is issued, and the two subsequent cycles. This corresponds 
optimistic case of a load miss being forwarded to the Scac 
(accounting for the cycle saved by the bypass that sends 
directly to the Scache when there is nothing else pending) 


2.5.2 Read Requests to the Chox 


When merging does not occur, a new MAF entry is al 
miss. Merging i is done for two load instructions is ued | si 










0X aeeapts the reference. This permits the system 
'y those INT8s that are actually requested by load 


each: uch register i in a separate INT8 in memory. 
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2.5.4 MAF Entries and MAF Full Conditions 





trap occurs again. 


2.5.5 Fill Operation 
Eventually, the Chox provides the data req 














bubble prevents any instruction fri i 
Mbox instructions (particularly load q 






ni the Ebox/Mbox pipelines to 
format the data and load the regi le. “Téizses the second bubble cycle to fill 
the Deache. P 


An instruction typically wri le in pipeline stage 6 (see 


ivcone register file write port per integer 












bbles, followed by one no-Mbox-instruction bubble. The 
requested speculatively before it is known whether the Scache or 
xternal Beache will hit. 


‘or fills from the Chox to floating-point registers, no cycle is allocated. Load 
instructions that conflict with the fill in the pipeline are forced to miss. Store 
instructions that conflict in the pipeline force the fill to be aborted in order 
keep the Dcache available to the store operation. In all cases, the floating- 
xoint registers are filled as dictated by the associated MAF entry. The Fbox 
has separate write ports for fill data as is necessary for this fill scheme. 
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Up to two floating or integer registers may be written for each Chox fil, 
cycle. Fills deliver 32 bytes in two cycles: two INT8s per cycle. The MA: 
merging rules ensure that there i is no more than one register to yh for 






provided there is a miss recorded for that INTS8. 


Load misses are all checked against the write buffer contents for 
between new load instructions and previously issued sto 






illy sent the 
-correct ordering 






1. Reading the Deache tag 
load instruction ould ri 











‘ Deache data store in the same early stage that, 
ache tag store. Therefore, a load instruction would conflict 
ee: if it were issued in the second cycle after any store 


ad instruction that is issued one cycle after a store instruction in the 

ine creates a conflict if both access exactly the same memory location. 
urs because the store instruction has not yet updated the location 
e load instruction reads it. This conflict is handled by forcing the 
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load instruction to replay trap. The Ibox flushes the pipeline and reétarts 

execution from the load instruction. By the time the load instructior:arrives 
at the Dcache the second time, the conflicting store instruction has wii 
Deache and the load instruction is executed normally. 


Software should not load data immediately after storing it. 
that is incurred “costs” seven cycles. The best solution is ¢é 
instruction to issue three cycles after the store. Noi issue st 
will occur in that case. If the load instruction is sc 
after the store instruction, it will be issue-stalled ft 
an optimal solution, but is preferred over incurri 
instruction. 


















For three cycles ee store instruction ex, g 


applies to fills of floating-point ce 
such that an integer fill never conflic 
stages 4 or 5. Instead, a store i in ae i 








¢tion from being issued until all conflicting load instructions 
been issued to the Chox. 







ita from being cached in the Deache. 


check is performed for each new store against store instructions in the write 
buffer that have already been sent to the Cbox but have not been completed. 
Section 2.7 describes this process. 
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2.7 Write Buffer and the WMB Instruction 
The following sections describe the write buffer and the WMB instruction 


2.7.1 The Write Buffer 
The write buffer contains six fully associative 32-byte entries 















CPU cycle. This is greater than the average rate at 
sale the data if Scache misses occur. 


FETCH, and FETCH_M instructions are also 
sent off-chip. However, unlike store pace 
iffer entry with other instructions. 
Ain one of these commands. 


2.7.2 The WMB Instruction 
The WMB instruction has a speciaf 








containing valid store data 
A merging with any of the 
a is marked with a WMB flag. At 


WMB instruction rete 
Each write-buffe 







‘and a pending request queue. The free-entry queue contains pointers 
. jlable invalid write-buffer entries. The pending-request queue contains 
ainters to valid write-buffer entries that have not yet been issued to the Chox. 
pending-request queue is ordered in allocation order. 
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Each time the write buffer is presented with a store instruction, the:physical 
address generated by the instruction is compared to the address in é4éh valid 
write-buffer entry that is open for merging. If the address is in the S 
INT32 as an address in a valid write-buffer entry (that also corte 


















data i is written into the entry at the top of the free- -entry que 


pending-request queue. 


2.7.4 Write-Buffer Entry processing 


iding-request queue 
‘hen the Cbox has completely 
‘Mbox, and the now invalid 


ns an STQ_C, STL_C, FETCH, or FETCH_M instruction 
: buffer is marked by a WMB flag 


This ensures that these instructions complete as quickly as possible. 
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Chox. 


Load misses are checked in the write buffer for conflicts. The'¢ 
this check is an INT32. Any load instruction matching any writé-Huff 
address is considered a hit even if it does not access an. ENE4 


is set in the load instruction’s MAF entry, which prev 

from being issued to the Cbhox before the conflicting wii 

been issued and completed. At the same time, the:no-mergt 

write-buffer entry with which the load hit. A w a ish: flag is also set. 
The Mbox continues to aa that write-b i processed until all 





° Total cycles 
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e Pipe dry 
e §6Pipe freeze 
¢ Mispredicts and cache misses 


¢ Counts for various instruction classifications 


e Hardware interrupt clear (HWINT_CLR) regis 
e Interrupt summary register (ISR) (see S 


MLO-011301 


Description (Meaning When Set) 


Summary bit (SUM). Records bitwise OR of FPCR exception bits. Equal to 
PCR<57 | 56 | 55 | 54 1 53 | 52> 


nexact disable (INED). Suppress INE trap and place correct IEEE 
nontrapping result in the destination register if the 21164 is capable of 
producing correct IEEE nontrapping result. 








(continued on next page) 
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Table 2-9 (Cont.) Floating-Point Control Register Bit Descriptions 
























Bit Description (Meaning When Set) 

<61> Underfiow disable (UNFD). Subset support: Suppress UNE, 
is also set and the /S qualifier is set on the instruction. 

<60> Underflow to zero (UNDZ). When set together with UN# 


the hardware places a true zero (all 64 bits zero) in the 


<59,58> 
00 
01 Minus infinity 
10 Normal rounding: 
11 Plus infinity = 
<57> etic operation or a conversion 
ed the destination precision. 
<56> rithmetic or conversion operation gave a 
8 the emetiomatiealy exact result. 
<55> i: 
<54> 
<53> 


<52> (INV). An attempt was made to perform a floating 


rsion, or comparison operation, and one or more of the 
were illegal. 


erflow disable (OVFD). Not supported. 
1 sion by zero disable (DZED). Not supported. 
Invalid operation disable (INVD). Not supported. 


Reserved. Read as zero; ignored when written. 
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2.10 Design Examples 


The 21164 can be designed into many different uniprocessor and multipte 
system configurations. Figures 2-4, 2-5, and 2-6 illustrate three oss ; 
configurations. These configurations employ additional syste 
controller chipsets. 














Figure 2—4 shows a typical uniprocessor system with a boai 
system configuration could be used in standalone or networke 


Figure 2-4 Typical Uniprocessor Configuration 


: vores a interface controller must employ a duplicate tag store 
maintain encke coherency. This system configuration could be used in a 
‘kéd:database server application. 
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Figure 2-5 Typical Multiprocessor Configuration 


21164 Cache 
_— 
ae Addr/cmd 
Extemal 
Cache 
Data 
Data 
Bus nu Set 






hultiprocessor system. This system configuration 
ndwith dedicated server applications. 
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Figure 2-6 Cacheless Multiprocessor Configuration 


21164 21164 
Bus Bus 
Interface Interface 






MLO-012977 
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Figure 3-1 Alpha 21164 Microprocessor Logic Symbol 


addr_bus_req_h 
data_bus_req_h 


’ fill_h 

fill_error_h 

fill_id_h 
fill_nocheck_h 
system_lock_flag_h 
idle_be_h 

shared_h 


irq_h<3:0> 
sys_mch_chk_irg h 
mch_hit_irq_h 
pwr_fail_irq_h 


osc_clk_in_h 
osc_clk_in_! 
ret_clk_in_h 
ck_mode_h<1:0> 






Test Modes and 
Miscellaneous 
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tag_ram_we_h 
data_ram_oe_h 
data_ram_we_h 
int4_valid_h<3:0> 
scache_set_h<1:0> 


cpu_clk_out_h 
sys_cik_outi_h 
sys_clk_out1_|! 
sys_clk_out2_h 
sys_clk_out2_| 


test_status_h<1:0> 
tdo_h 

tms_h 

trst_| 

srom_cik h 
srom_oe | 


srom_present_| 


MK145506 





















3.2 Alpha ation Signal ee and Functions 




















used for functional signals. There are three spare (unused) sigx 
remaining pins are used for power (104) and ground (101). « 


The following table defines the 21164 signal types referred: this sectio 


Signal Type Definition 

B Bidirectional 
I Input only 
O Output only 


The remaining two tables describe the fun: ion of éa¢h. aa external signal. 


descriptions. Table 3-2 lists igh hy. 
description. 


Table 3-1 Alpha 21164 Signal Descripti 
Signal Type 5 


addr_h<39:4> B 


addr_bus_req_h s bus request. The system interface uses this 
ignal to gain control of the addr_h<39:4>, addr_cmd_ 


par_h, and cemd_h<3:0> pins. 





addr_cmd_par_h .. Address command parity. This is the odd parity bit on 
he current command and address buses. The 21164 
takes a machine check if a parity error is detected. The 


system should do the same if it detects an error. 


(continued on next page) 


Preliminary Edition-September 1994 3-3 


Table 3—1 (Cont.) Alpha 21164 Signal Descriptions 
Count Description 


Signal 


addr_res_h<1:0> 


addr_res_h<2> 


cack_h 


cfail_h 


clk_mode_h<1:0> 


emd_h<3:0> 


Type 
O 


2 











Address response bits <1> and <0>. For. 


commands, the 21164 uses these pins t indicate GE 
state of the block in the Scache. : 


Command 
NOP 


01 NOACK 
ACK/Scache 


Bits 




















2. Thé system interface uses this 
y one of the commands driven 








im “event. This causes the 21164 to do a rere 
8t.and trap to the machine check (MCHK) PALcode 
“point, which indicates a serious hardware error. 


g ock test mode. These signals specify a relationship 
tween osc_clk_in and the CPU cycle time. These 
signals are not asserted in normal operation mode. 





Command bus. These signals drive and receive the 
commands from the command bus. The following 
tables define the commands that can be driven on the 
emd_h<3:0> bus by the 21164 or the system. For 
additional information, refer to Section 4.1.1.1. 


(continued on next page) 
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Table 3—1 (Cont.) Alpha 21164 Signal Descriptions 


Signal 


Type 


Count Description 


21164 Commands to System: 


cmd_h 
<3:0> 


0000 
0001 
0010 


0011 


Command 


NOP 
LOCK 
FETCH 


READ MISSO 
READ MISS1 
READ MISS MODO 


READ MISS MOD1 


BCACHE VICTIM 


READ MISS MOD 
STCO 


READ MISS MOD 
STC1 














F ster address. 


:ti64 passes a 
H instruction to 


The 21164 passes a 
FETCH_M instruction | 
to the system. 


MB instruction. 


Dirty bit set if shared 
bit is clear. 


Request to write a 
block. 


Request to write a 
block with lock. 


Request for data. 
Request for data. 


Request for data; 
modify intent. 


Request for data; 
modify intent. 


Bcache victim should 
be removed. 


Reserved. 


Request for data, 
STx_C data. 


Request for data, 
STx_C data. 


(continued on next page) 
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Table 3—1 (Cont.) Alpha 21164 Signal Descriptions 
Signal Type Count Description 















System Commands to 21164: 


cmd_h 
<3:0> Command 
0000 NOP 


0001 FLUSH 


0010 


0011 : 
“shared state. 


Read a block. 


Read a block; set 
shared. 


Read a block; 
invalidate. 


epu_clk_out_h 
dack_h 


clock Gté ut. This signal is used for test purposes. 


knowledge. The system interface uses this 
“te:control data transfer between the 21164 and 
ysterm: 


data_h<127:0> us. These signals are used to move data between 


» the 24164, the system, and the Beache. 


data_bus_req_h ita bus request. If this signal is asserted in sysclk 
n, then the 21164 does not drive the data bus in sysclk 
n+2. Before asserting this signal, the system should 
assert idle_bc_h for the correct number of cycles. If 
this signal is deasserted in sysclk n, then the 21164 
drives the data bus in sysclk n+2. For timing details, 
refer to the sections on bus transactions in Chapter 4. 


Data check. These signals set even byte parity or INT8 
ECC for the current data cycle. Refer to Section 4.13.1 
for information on the purpose of each data_check_h 
bit. 


(continued on next page) 
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Table 3—1 (Cont.) Alpha 21164 Signal Descriptions 
Signal Type Count Description 


data_ram_oe_h O 1 
Beache reads. 


data_ram_we_h O 1 Data RAM write enable. This sig 


Bcache write operation. Refer us 
details. 


dc_ok_h I 1 dc voltage OK. Must be ‘ 
reaches proper operating 
asserted. 
fill_h I 1 


Beac e in sysclk n+2. The 
fhat.sysclk. At the end of the 
n+1, the*81164 waits for the next 

ns the write again if dack_h is not 


fill_error_h I 1 






entry poity i. 
ror_h Sfeald | be asserted when the data is returned. 
a assertion produces a MCHK trap. 


Fill identification. Asserted with fill_h to indicate which 
fegister is used. The 21164 supports two outstanding 
“Téad instructions. If this signal is asserted in sysclk n, 
the 21164 provides the address from miss register 1. If 
it is deasserted, then the address in miss register 0 is 
used for the read operation. 


Fill checking off. If this signal is asserted, then the 
21164 does not check the parity or ECC for the current 
data cycle on a fill. 


Idle Beache. When asserted, the 21164 finishes the 
current Beache read or write operation but does not 
start a new read or write operation until the signal is 
deasserted. Systems must assert this signal in time to 
idle the Bcache before fill data arrives. 


Index. These signals index the Bcache. 





fill_id_h 





fill_nocheck_h 


(continued on next page) 
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Table 3—1 (Cont.) Alpha 21164 Signal Descriptions 










Signal Type Count Description 
int4_valid_h<3:0> O 4 INT4 data valid. During write operatio 


are used to indicate which INT4 bytes 6f:< 
This is useful for noncached write opé Z 


fiead Meaning 
data_h<63:0> valid 
data_h<127:64> valid 
data_h<191:128> valid 
data_h<255:192> valid 


ss. or both read and write operations, multiple 
int “valid_h<3:0> bits can be set simultaneously. 








(continued on next page) 
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Table 3-1 (Cont.) Alpha 21164 Signal Descriptions 
Signal Type Count Description 


irq_h<3:0> I 4 System interrupt requests. These sign 
i ak 


“ Machine halt interrupt request. This signal has 
multiple modes of operation. During initialization, 

this signal is used to set up sys_clk_out2_h,] delay (see 
Table 4-3). During normal operation, it is used to signal 
a halt request. 


mch_hlt_irg_h 


Oscillator clock inputs. These signals provide the 
differential clock input that is the fundamental timing 
of the 21164. These signals are driven at twice the 
desired internal clock frequency. (Under normal 
operating conditions the CPU cycle time is one-half 
the frequency of osc_clk_in.) 


(continued on next page) 
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Table 3-1 (Cont.) Alpha 21164 Signal Descriptions 
Type 


Signal 


perf_mon_h 
port_mode_h<1:0> 


pwr_fail_irg_h 


ref_clk in_h 


scache_set_h<1:0> 
shared_h 


srom_clk_h 


srom_data_h 


srom_oe_l 


srom_present_l 


I 


I 


Count Description 


1 














Performance monitor. This signal provides 
21164 internal performance monitoring: bardw 
off-chip events om as bus activity 















ITE B OCK LOCK command is acknowledged, 
can be used to keep the block status shared or 
e Scache. 


ROM SS 128 times the cycle time of the CPU clock. 
éFial ROM data. Input for the SROM. 





: Serial ROM output enable. Supplies the output enable 
» to the SROM. 


Serial ROM present. Indicates that SROM is present 
and ready to load the Icache. 


System clock outputs. Programmable system clock 
(cpu_clk_out_h divided by a value of 3 to 15) is used 
for board-level cache and system logic. 


(continued on next page) 
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Table 3—1 (Cont.) Alpha 21164 Signal Descriptions 





















Signal Type Count Description 
sys_clk_out2_h O 1 System clock outputs. The value of:sys 
sys_clk_out2_l O 1 


cycles. 
sys_mch_chk_irg_h I 1 
sys_reset_] I 1 System reset. This signal pr e 
damage eo inith 
system_lock flag h I 1 “the 21164 logically ANDs 


n copy with its own copy to 


tag_ctl_par_h B 


yati shoiild drive the correct parity ‘based on 
He: Valid, shared, and dirty bits. 


tag_data_h<38:20> 


data parity bit. This signal indicates odd parity for 
ata_ h<38:20>. 


Tag tty state bit. During fills, the system should 
assert this signal if the 21164 request is a READ MISS 
MOD, and the shared bit is not asserted. Refer to 
Table 4—6 for information about Bcache protocol. 


* Tag RAM output enable. This signal is asserted during 
any Bcache read operation. 


Tag RAM write enable. This signal is asserted during 
any tag write operation. During the first CPU cycle of 
a write operation, the write pulse is deasserted. In the 
second and following CPU cycles of a write operation, 
the write pulse is asserted if the corresponding bit 

in the write pulse register is asserted. Bits BC_ 
WE_CTL<8:0> control the shape of the pulse (see 
Section 5.3.5). 


tag_data_par_h 


tag dirty _h 





tag_ram_oe_h 


tag_ram_we_h:: 


(continued on next page) 
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Table 3-1 (Cont.) Alpha 21164 Signal Descriptions 


Signal 
tag_shared_h 


tag_valid_h 


tek_h 
tdi_h 
tdo_h 


temp_sense_h 


test_status_h<1:0> 


tms_h 
trst_l 


victim_pending_h 


















Type Count Description 


B 1 Tag shared bit. During fills, the system:s] 
this signal with the correct value to mavk 
block as shared. See Table 4—6 for ink . 
Beache ee 


B 1 
I 1 
I 1 
O 1 
I 1 
O 2 





stay 
if ICSR<39> is true, on Ibox 
x d if the Icache built-in 










="select signal. 
test access port (TAP) reset signal. 





im périding. When asserted, this signal indicates 
the current read miss has generated a victim. 








was | until the victim is removed. 
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Table 3—2 lists signals by function and provides an abbreviated descri 


























Table 3-2 Alpha 21164 Signal Descriptions by Function 

Signal Type Count Description 
Clocks 
clk_mode_h<1:0> 
cpu_clk_out_h 
ose_clk_in_h,l 
ref_clk_in_h 
sys_clk_outl_h,l 
sys_clk_out2_h,l 


sys_reset_l 


Clock test mode. 
CPU clock ou 


4Hoorrore 


Bcache 


data_h<127:0> BD 
data_check_h<15:0> B 
data_ram_oe_h 


ata:RAM output enable. 


data_ram_we_h Data RAM write enable. 








index_h<25:4> dex. 
tag_ctl_par_h Tag control parity. 
tag_data_h<38:26>. Beache tag data bits. 
tag_data_par_h Tag data parity bit. 
tag_dirty_h Tag dirty state bit. 
tag_ram_o« Tag RAM output enable. 
Tag RAM write enable. 
Tag shared bit. 
Tag valid bit. 


(continued on next page) 
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Table 3—2 (Cont.) Alpha 21164 Signal Descriptions by Function 
Signal Type Count Description 




















System Interface 


addr_h<39:4> B 36 Address bus. 
addr_bus_req_h I 1 Address bus request. 
addr_cmd_par_h B 1 Address command 
addr_res_h<2:0> O 3 Address response 
cack_h I 1 Command ackno’ edge: 
efail_h I 1 Command fail. ~ 
cemd_h<3:0> B 4 

dack_h I 1 

data_bus_req_h I 1 

fill_h I 1 

fill_error_h I 1 

fill_id_h I 1 

fill_ nocheck_h I 

idle_bc_h I ; 
int4_valid_h<3:0> O ‘ , data valid. 


Secondary cache set. 
:, Keep block status shared. 
” System lock flag. 


scache_set_h<1:0> 
shared_h 
system_lock_flag_h 
victim_pending_h Victim pending. 


Interrupts 






System interrupt requests. 
Machine halt interrupt request. 


Power failure interrupt request. 


= = = 


System machine check interrupt request. 


(continued on next page) 
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Table 3-2 (Cont.) Alpha 21164 Signal Descriptions by Function 
Signal Type Count Description 
Test Modes and Miscellaneous 
dc_ok_h 


perf_mon_h I 





dc voltage OK. 


Performance monitor. 


jas 


port_mode_h<1:0> I 


iW) 










JTAG test access port (TAP) reset. 





srom_clk_h O 

srom_data_h I 

srom_oe_l O 

srom_present_l B 

tck_h I 

tdi_h I 

tdo_h O |-boundary scan data out. 
temp_sense_h I ; ture sense. 
test_status_h<1:0> O I 

tms_h Bs IT. test mode select. 
trst_l B 
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Clocks, Cache, and oo nterface 





which includes the backup cache (Bcache); 
describes the clock circuitry, locks, inter, 





ink aces. It ala 
d ECC/parity 









Introduction to the external inte’ : 
Clocks 


Physical address considerations 






Beache structure and @ 


Cache coherency 





Locks mechantisr 0 


Chapter 3 lists and defines all 21164 hardware interface signal pins. Chapter 9 
describes the 21164 hardware interface electrical requirements. 
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4.1 Introduction to the External Interface 


A 21164-based system can be divided into three major sections: 



















¢ Alpha 21164 microprocessor 

¢ Optional external Bcache 

¢ System interface logic 
— Optional duplicate tag store 
— Optional lock register 
- Optional victim buffers 


The 21164 external interface is flexible and m 
allowing a wide range of prospective systems 
bidirectional data bus, a 36-bit pidieqaona = 
signals. 
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Figure 4-1 Alpha 21164 System/Bcache Interface 
























addr_h<39:4> 
addr_bus_req_h 
addr_cmd_par_h 
addr_res_ h<2:0> : 


cmd_h<3:0> 


fill_error_h 


fill nocheck_h 
idle_bc_h- 
int4_valid_<3:0> 








Bcache 
Interface 








pwr_fail_ irq h 
sys_mch_chk_irq_h 


MK-1455-04 
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4.1.1.1 Commands and Addresses 


The 21164 can take up to two commands from the system at a time. Th 
Seache or Bcache or both are probed to determine what must be done wi 
command. 


e If nothing is to be done, the 21164 acknowledges receivin 


e Ifa Beache read, set shared, or invalidate operation is r 
21164 performs the task as soon as the Bcache beco: 
21164 acknowledges receiving the command at thes 
transaction. 


There are two miss and two victim buffers i in the BI 


e An Scache victim occurs when the of 
Scache. 


4.1.2 Bcache Interface 


the:eache location to be filled with data from the desired address. Data in the 
tim buffers will be written to memory at a later time. This action reduces 
time that the 21164 is waiting for data. 


4—4 Preliminary Edition-September 1994 


4.2 Clocks 












Signal Description 





cpu_clk_out_h A 21164 internal clock that may or ney noé¢ 
sys_clk_out1_h,l 
sys_clk_out2_h,l_ A delayed copy of sys_clk_out1_h,l2The: 

and is an integer number of cpu_ 


The 21164 may use ref_clk_in_h as a reference c 
sys_clk_outl_h,l and sys_clk_out2_h,l. 


4.2.1 CPU Clock 


Description 










Normal Usual operation—CPU clock frequency 
is % input frequency. 
Chip test CPU clock frequency is the same as the 


input clock frequency to accommodate 
chip testers. 


4 CPU clock frequency is % input 
frequency to accommodate module 
testers. 


— Initializes CPU clock allowing system 
clock to be synchronized to a stable 
reference clock. 


Caution 


A clock source should always be provided on osc_clk_in_h, 1 when 
signal de_ok_h is asserted. 
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Figure 4-2 Clock Signals and Functions 














osc_clk_in_h, | CPU Clock 


x Digital 
clk_mode_h<1:0> a Divider 


ce a (1, /2, or /4) : PLL 


System Clock 
Divider 
(/3 through /15) 


cpu_clk_out_h 


ref_clk_in_h 
















sSsys_clk_outi_h,| 
irq_h<3:0> Se 


mch_hit_irq_h 
pwr_fail_irq_h 


sysmchchk ih |  —_ 


sys_reset_| 
dc_okay_h 


sys_clk_out2_h, | 
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4.2.2 System Clock 
The CPU clock is:the: 
outl1_h, 1. Thé:sys 
1. The divisor, 3 to, 


sé:elock used to generate the system clock sys_clk_ 
elock divisor controls the frequency of sys_clk_out1_h, 
, is obtained from the four interrupt lines irq_h<3:0> at 
m Table 4-2. The system clock frequency is determined by 
4¢ CPU clock frequency. 
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Table 4—2 System Clock Divisor 


irq_h<3> 
Low 
Low 
Low 
Low 
Low 
High 
High 
High 
High 
High 
High 
High 
High 


irq_h<2> 
Low 
High 
High 
High 
High 
Low 
Low 
Low 
Low 
High 
High 
High 


irq_h<1> 
High 


LJ-03676-TI10 


irq_h<0> 
High 
Low 
High 
Low 
High 


Low 




















Ratio 
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4.2.3 Delayed System Clock 


The system clock sys_clk_out1_h, 1 is the source clock for the delayed s; 
clock sys_clk_out2_h, 1. These clock signals provide flexible timin, 
use. The delay unit, 0 to 7, is obtained from the three interrupt.sii 
hit_irg_h, pwr_fail_irg_h, and sys_mch_chk_irg_h at powe 
Table 4—3. The output of this programmable divider is symmet 
is even. The output is asymmetric if the divisor is odd. 






















Table 4-3 System Clock Delay 
sys_mch_chk_irq_h pwr_fail_irg_h 


Low Low 

Low Low 

Low High 2 
Low High 3 
High Low 4 
High Low 5 
High High 6 
High High 7 


4.2.4 Reference Clock > 


to that reference cl k. 


“sys°clk_outl must be slightly higher, but no greater than 
ter, than that of signal ref_ elk_ in_ h. This causes the rising edge of 
dri 
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Figure 4—4 shows a multiprocessor 21164 system synchronized to a 
clock. 










Figure 4-4 Alpha 21164 Reference Clock for Multiprocessor Syster 


ref_clk_in sys_clk_out |. 


21164 


Reference 


Clock 





Memory 
ASIC 


ref_clk_in 


Bus 
ASIC 


LJ-03675-TIO 





has stabilized (20 cycles after irq_h<3:0> have 
delay before sys_clk_out1_h,l comes into lock with 
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4.2.4.1.1 Case 1: ref_cik_in_h Initially Sampled Low by DPLL When:the 

DPLL initially samples ref_clk_in_h in the low state, as shown in Figure:4—5, 
it slips its internal cycle repeatedly until it samples ref_clk_in_h in the high 
state. After it samples ref_ clk_in_h in the high state, the DPLL stays.in Iotk 
mode. 




















Figure 4-5 ref_cik_in_h Initially Sampled Low 


CPU Clock 
(Internal) 


sys_cik_out 1_h 


ref_cik_in_h 






internal cycle (the fr 
specification of 0. 35%. : ti average of 0.35% period is added to each 
sys_clk outl —b; 8 i Jock mode is reached. 


suming the worst case ref_clk_in_h duty cycle is 60/40 to 40/60: 


SettlingTime = 2S*RefCrackRertod — 171+ RefClockPeriod 
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Depending upon the sys_clk_out1_h,1 ratio, the DPLL may come into lock 
much more quickly. The DPLL may insert phase slips more frequent 
smaller sys_clk_out1_h,] ratios. 


4.2.4.1.2 Case 2: ref_cik_in_h Initially Sampled High by DPE : 
DPLL initially samples ref_clk_in_h in the high state, as 
it will not slip its internal cycle until it samples ref_clk. i 
After it samples ref_clk_in_h in the low state, the DPLL stay: 




















Figure 4-6 ref_clk_in_h Initially Sampled High 


CPU Clock 
(Internal) 


sys_cik_out 1_h 


ref_clk_in_h 


The rate at which 1 S¥8_ € 


SettlingTime = Se ee TA enero ornod 


Note 


The reference clock high ratio equals the portion of the ref_clk_in_h 
period that ref_clk_in_h is high. 


Preliminary Edition-September 1994 4-11 


Assuming the worst case ref_clk_in_h duty cycle is 60/40 to 40/60: 
















SettlingTime = 2S*ReferackPertod _ 444 « Re fClockPeri¢ 


4.3 Physical Address Considerations 
This section lists and describes the physical address regi 
wrapping characteristics of physical addresses are als@descr: 

4.3.1 Physical Address Regions 
Physical memory of the 21164 is divided into t 


1. The first region is the first half of the ph; 
by the 21164 as memory-like. 


pace. It is treated 





2. The second region is the second half‘o! 
for a 1M-byte region reserved for Cb 
noncachable. 


3. The third region is the 1M-b 






which INT8s are accessed. Write merging is 
32-byte requests with a mask indicating which 
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Table 4-4 Physical Memory Regions 
Region Address Range Description 


Memory-like 00 0000 0000- 























TF FFFF FFFF i¢ merging allowed. 
Noncacheable 80 0000 0000- Not cached, load mer, 
FF FFEF FFFFi¢ 
IPR region FF FFFO 0000— 
FF FFFF FFFF i¢ 


4.3.2 Data Wrapping 


are selected ih addr_h<5:4>. They, ar 
0, 1, 2,3 


, 0 
,3 
2 


? 


? 3 


od = 
Pow 
ore by 


? 9 


For 32-byte blocks, the 


rs are selected by addr_h<4>. They 
are: = 


0,1 
1, 0 


WRITE BLOCK a 
not wrapped.:Eh: 


‘E: BLOCK LOCK commands from the 21164 are 
write INT16 0, 1, 2, and 3. BCACHE VICTIM 





hysical addresses that have addr_h<39> asserted are 

in the Dcache, Scache, or Bcache. They are merged like other read 
je the miss address file (MAF). To prevent several read operations 
“noncached' memory from being merged into a single 32-byte bus request, 
software must insert memory barrier (MB) instructions or set MAF_ MODE 
IPR bit [IO_NMERGE]. The MAF merges as many Dstream read operations 
together as it can and sends the request to the BIU through the Scache. 
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Rather than merging two 32-byte requests into a single 64-byte requestgithe 














write the Dcache, Scache, or Bcache with the fill data. The re 
written in the register file or Icache. 


Note 


A special case using int4_valid -h<3:0> occurs du 






not written to any of the caches. These eu 
write buffer before being sent to the system: 


s , 32M-byte and 64M-byte Bcache. 
The size is under proégé se is ‘specified by BC_CONF<2:0>, 


(BC_SIZE<2:0>). 












or 64-byte blocks. The block size must be the same 
g SC_CTL<12>, [SC_BLK_SIZE]. 


The 21164 partitions physical address (addr_h <39:5>) into an index field and 
..ackag field. The 21164 presents index_h <25:4> and tag_data_h<38:20> to 
Bcache interface. 
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The system designer uses the signal lines needed for a particular size. Bcache. 


For example the smallest Bcache (1 MB) needs index_h <19:4> to address the 
cache block while the tag field would be tag_data_h<38:20>. 
















Only those bits that are actually needed for the amount of cact é = 


relevant tag address bits for that Beache size on its tag ca 
Bcache uses more index bits and fewer tag address bits 





requires two data cycles for a 32-byte block or four: 
block. : 


4.4.1 Duplicate Tag Store 


bus to the 21164. 


In systems without a Beache it is po 
Scache tag store and to model th 


4.4.1.1 Full Duplicate Tag Store 
The complete Bcache duplic: ‘suld contain an entry for each 


co entry would contain state bits for 


. ay three Scache sets. It should also have two 
ictim buffers. Figure 4—7 is a simplified diagram 
‘es of interest. 
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Figure 4-7 Full Scache Duplicate Tag Store 













scache_set_h<1:0> 


addr_h<14:6> 
(Index) 


tag_shared_h, 


tag_dirty_h, 
tag_valid_h 


addr_h<39:15> 
(Tag Data) 


victim_pending_h 


MLO-012395 
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The system should use the algorithm shown in Figure 4—8 to maintain the 
duplicate tag store. 







Figure 4-8 Duplicate Tag Store Algorithm 


Push new entry into 
duplicate tag sto 
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4.4.1.2 Partial Duplicate Tag Store 
System designers may also choose to build a partial duplicate tag atone 


for each block in the Scache, and for the two victim buffers insi 
If a system bus transaction hits in the partial duplicate tag sto: . 

block may be in the Scache. If a system bus transaction misség:in the parla 
duplicate tag store, then the block is not in the Scache. 


Figure 4-9 Partial Duplicate Tag Store 


scache_set_h<1:0> 


addr_h<14:6> 
(Index) 


addr_h<m:n> 


(Part of <39:15> Tag Data) 


victim_pending_h © 
MLO-012397 


4.5 Cache Cohereti¢y. 


Deag 2 ® and the a 


the system does not have a Beache the system designer must create 
chanisms i in the system interface logic to support cache coherency between 
he, main memory, and other caches in the system. 
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If the system has a Bcache, the 21164 maintains cache coherency befween the 











system. 


4.5.1 Cache Coherency Basics 
(Figure 4—10). 


Figure 4-10 Cache Subset Hierarchy 


Main Memory 


MK-1455-01 
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Tasks that must be performed to maintain cache coherency follow: 


e The Chox in the 21164 maintains coherency in the Deache and keeps‘i€: 
a subset of the Scache. 


e¢ If an optional Bcache is present, then the 21164 maintains 
subset of the Bcache. The Scache is set associative but is 
the larger externally implemented direct mapped Beache. 









memory system. 


The 21164 requires the system to allow only 






moved. 


port: several cache coherency 
classes: write invalidate 


The 21164 includes hardware mechanisms 
protocols. The protocols can be sepg 









is not sé th e scualule designer may include an Scache duplicate 
eve system performance. 









*6tocol does not allow shared data in the cache. 


protocol does not require a duplicate tag store. Because the duplicate tag 
is optional for this protocol, the Bcache is probed for each transaction to 
ne if the block is present. If the block is present, the requested action 
is taken; if the block is not present, the command is ignored. 
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arene in more detail while Section 4.5.4 and Section 4.5.5 provide 
detailed description of flush cache coherency protocol. 












Scache 





Duplicate Lock 
Cache Protocol Scache Tag Register 
Write invalidate (1) Yes No 
Write invalidate (2) Yes Required 
Write invalidate (3) Yes Required 


Write invalidate 1 


at the cache i is larger. 
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4.5.3 Write Invalidate Cache Coherency States 


Each processor in the system must be able to read and write data as if 4 
transactions were going onto the system bus to memory or I/O modules. 
Therefore, the system bus is the point at which cache coherency, must b 
maintained. 


























for 21164 systems. 


Table 4-6 Bcache States for Cache Coherency Protaéo 
Valid' Shared' Dirty’ State of Cache Line 


0 xX x Not valid. 
1 0 0 


Note 


s tems, the 21164 will not take an update to a 
ad will invalidate the block. 


Unlike SO: 
shared:bloe 
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4.5.3.1 Write Invalidate Protocol! State Machines 


Figure 4—11 shows the 21164 cache states that can occur as a resu 
transactions to the system. 









Figure 4-11 Write Invalidate Protocol 21164 States 


READ @) 
(CPU Read Operation) 






WRITE BLOCK G) 
(CPU Wite Operation) 













MLO-012934 


“1,8,D” indicate the INVALID, SHARED, and DIRTY 
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Figure 4—12 shows the 21164 cache states changes maintained by the 23164 as 
a result of transactions by other nodes on the system bus. 


Figure 4-12 Write invalidate Protocol System/Bus States 


INVALIDATE 
(us Wite Operation) 


READ DIRTY 
F (Bus Read Operation) 


SET SHARED 
(Bus Read Operation) 


READ DIRTY 
(Bus Read Operation) 


MLO-012935 


4.5.4 Flush Cache Cohi 


All 21164-based system: 
the combination: 


| Systems 
mptement the flush cache protocol must have 


ts listed in Table 4-7. For example, a system 
; having a Bcache and a Bcache duplicate tag 


a lock register. 
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Table 4-7 Components for 21164 Flush Cache Protocol Systems 



















Scache Bcache 
Duplicate Duplicate . 
Cache Protocol Scache Tag Bcache Tag 


Flush protocol (1) Yes No No 
Flush protocol (2) Yes No Yes 
Flush protocol (3) Yes No Yes 


ue based 1 


mechanism statis.” 


Flush-based 2 


This system has an extert 
System logic and 21164. 5 
system. 


Flush-based:3 
This ce h 


em “ = updates its lock mechanism status. 
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4.5.5 Flush-Based Protocol State Machines 


Figure 4-13 shows the 21164 cache states that can occur as a result of | 
transactions with the system. 















Figure 4-13 Flush-Based Protocol 21164 States 


READ MISS MOD 
(CPU read for. 
Wite intent operation.) 


READ 
(CPU Read Operation) 






SET DIRTY” : 
(CPU Wite Operation) 


* 


Optionally this transition can be configured te 





FLUSH 
(DMA Wite Operation) 






READ 
Read Operation) (DMA Read Operation) 
MLO-012936 
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4.5.6 Cache Coherency Transaction Conflicts 


Cache coherency conflicts that can occur during system operation ar described 
here. Systems should be designed to avoid these conflicts. 





4.5.6.1 Case 1 
If the 21164 requests a READ MISS MOD transaction, its 













might cause a multiprocessor system to have live-lee 
that can cause long delays in writing from the 21164 


4.5.6.2 Case 2 
If the 21164 attempts to write a cleanvprivat 






erefore, if the bus has already 
he dirty bit is incorrect. The 


; mand‘hen the ownership of the | 
: ill be restarted and will use the 
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4.6 Locks Mechanisms 


The LDx_L instruction is forced to miss in the Dcache. When the Scach 
read, the BIU’s lock IPR is loaded with the physical address and thelock # fi 



















displaced from the cache system. 
The lock flag is cleared if any of the following events occ 


e Any write operation from the bus addresses the lo: 
INVALIDATE, or READ DIRTY/INV). 


e An STx_C is executed by the processor. 


¢ The locked block is refilled from memory 
cleared. 


Mishennis 


duplicate tag store to filter write traffic.“4 
and Beache; along with the subsetting rul 


thrashing of the locked block. Ea 
the value of the lock bs dean iss} 


Systems that do not: 
21164, are not requ 
should tie signal 





x begins i issuing memory instructions again. If the block is in the shared 
_ the BIU requests a WRITE BLOCK transaction. If the system CACKs 
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When the write buffer arrives at an STx_C instruction in ee ee 














the STx_C fails. 


4.7 21164-to-Bcache Transactions 


The 21164 interface to the system and Beas 
provides address and control one ae a 


bidirectional data bus. 


The 21164 controls all Bceache transact 
_ write to the Bcache with no assistance 

reads or writes to the Beache, # Hi 

only under the direct control | 


4.7.1 Bcache bine 
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4.7.2 Beache Read Transaction (Private Read Operation) 


Figure 4—15 shows an example of the timing for a private read operation i 
Beache by the 21164. The read speed is 4 because BC_CONFIG [ 
is set to 4, defaulting to the minimum read speed of 4 CPU cycle: 


Figure 4-15 Bcache Read Transaction 


data_h <127:0> 


tag_ram_oe_h 


the data associated*% 


The 21164 alway beter asserting the tag_ram_oe_h and 


les are deasserted after the fourth index address 
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4.7.3 Wave Pipeline 


The wave pipeline is implemented to improve performance for syste 










The wave pipeline is controlled using BC_CONFIG<7:4> [B 
and BC_CTL<18:17> [BC_WAVE<1:0>]. 


Arrows indicate when 21164 
clocks Bcache data into the 
pad ring. es an 


CPU Clock cycles | 





index_h <25:4> 


MLO-012430 
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4.7.4 Bcache Write Transaction (Private Write Operation) 


Figure 4—17 shows an example of the timing for a private write operatio 
Beache by the 21164. The write speed is 4 because BC_LCONFIG [BC_WR=: 
SPD] is set to 4, defaulting to the minimum write speed of 4 CPU. 








Figure 4-17 Bcache Write Transaction 









ad or write speed is 15 cycles. The minimum read or 
cept that in 32-byte mode the minimum read speed is 


She write enable signals will not be asserted. If the 9-bit field 
0 1FFj¢, then the write-enable signals will be asserted for 9 CPU cycles. 
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4.7.5 Selecting Bcache Options 


Table 4—8 lists the variables to consider when designing and implem ting a 
Beache. : 


















Table 4-8 Bcache Options 
Parameter Selection 


Sysclk ratio (3-15) 

Cache protocol, write invalidate or flush 

Cache block size 64/32 

ECC or byte parity 

Beache present? 
Bceache size (1 to 64M byte) 
Bcache read speed (4-15) 
Beache wave pipelining (0-3) 
Beache victim buffer? 
Bcache write speed (4-15) 


Beache read to write spacir 
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4.8 21164-Initiated System Transactions 


This section describes how commands are used to move data in and out 














It encounters a “miss”. 

A LOCK command is invoked. 
A WRITE command is directed at a shared block. 
A WRITE command is directed at a clean block i 


At the start of a Bceache a A 
control status of the target block. 


At a later time the: 
The 21164 wi : 
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Table 4-9 21164-Initiated Interface Commands 






















cmd_h 

Command <3:0> Description 
NOP 0000 The NOP command is driven bythe own 

cemd_h bus when it has no tas! ‘queued. 
LOCK 0001 The LOCK command is used t6 d the system | “ 
FETCH 0010 
FETCH_M 0011 
MEMORY 0100 
BARRIER 
SET DIRTY 0101 


Bt 
‘the. SET DIRTY command when it wants to write 
a ¢igan, private block in its Scache and it wants 
the dirty bit set in the duplicate tag store. The 
21164 does not proceed with the write until a CACK 
sponse is received from the system. When the 
“ACK is received, the 21164 attempts to set the 
dirty bit. If the shared bit is still clear, the dirty 
bit is set and the write operation is completed. If 
the shared bit is set, the dirty bit is not set and the 
21164 requests a WRITE BLOCK transaction. The 
copy of the dirty bit in the Beache is not updated 
until the block is removed from the Scache. 


Request to write a block. When the 21164 wants 
to write a block of data back to memory, it drives 
the command, address, and first INT16 of data on 
a sysclk edge. The 21164 outputs the next INT16 
of data when dack_h is received. When the system 
asserts cack_h, the 21164 removes the command 
and address from the bus and begins the write of 
the Scache. Signal cack_h can be asserted before 
all the data is removed. 


(continued on next page) 
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Table 4-9 (Cont.) 21164-Initiated Interface Commands 


Command 


WRITE BLOCK 
LOCK 


READ MISSO 


READ MISS1 


READ MISS 
MODO 


READ MISS 
MOD1 


BCACHE 
VICTIM 


cmd_h 
<3:0> 


0111 


1000 


1001 


1010 


1011 



























Description 


Request to write a block with lock. ; i 
is identical to a WRITE BLOCK co: mand ex 


space. 
Request for data. This ca 


im buffér in the system, this command is used 
s the address of the victim to the system. The 
RE? E MISS command that produced the victim 
precedes the BCACHE VICTIM command. Signal 


MISS command to indicate that a BCACHE VICTIM 
command is waiting, and that the Beache is starting 
=the read of the victim data. 


If the system does not have a victim buffer, the 
BCACHE VICTIM command precedes the READ 
MISS commands. The BCACHE VICTIM command 
is driven, along with the address of the victim. At 
the same time, the Bcache is read to provide the 
victim data. 


(continued on next page) 
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Table 4-9 (Cont.) 21164-Initiated Interface Commands 


cmd_h 
<3:0> 





Command Description 









The system must usé the valiés 6f.ad. 
that were sent with the READ MISS:command that 













— 1101 
READ MISS 1110 
MOD STCO 
READ MISS 1111 
MOD STC1 
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4.8.1 READ MISS—No Bcache 


A read operation to the Deache misses causing a read operation to the Scath 
which also misses. After the Scache miss there is no Beache probe-—the 21464 
sends a READ MISS command to the system. The system acknowledge: 

receipt of the READ MISS by immediately asserting cack_h as:shown ifi 
Figure 4—18. 




























Figure 4-18 READ MISS—No Bcache Timing Diagram 


sys_cik_out 
emd_h<3.0> 
addr_h<394> : 
cack h _ 

fill_h 

filid_h 
data_h<127:0> 


dack_h : 
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4.8.2 READ MISS and FILL 


The 21164 issues a READ MISS command if it encounters a cache as 
described in Section 4.8.2.1. The system acknowledges receipt of the co mimand. 
Later the system asserts fill_h and asserts data<127:0> on. tise ( 

and sequence as described in Section 4.8.2.2. 


4.8.2.1 READ MISS 





















the range 
e first read, 


asserted to the RAM for a programmable number 
of 4 to 15. The tag is accessed at the same time. 









MOD request. 


Figure 4-19 shows the READ MISS Mé 
cack_h as soon as it is sent. Thigallo 















systein to defer assertion of cack_h 
he system to use cmd_h<0> for the 


Dstream data. 
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Figure 4-19 READ MISS Timing Diagram 
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4.8.2.2 FILL ‘ 
Signals fill_h, fill_id_h, and seer are used to control the refiirn of 












information to select the correct fill address. Figur 
FILL command. 


If signals fill_h and fill_id_h are asserted at the nisin 
the 21164 asserts the Beache index and pee 










system must provide the data 
complete the fill correctly. For. 







quest In the second case, if the system has a victim buffer, the 21164 
uests the new block from memory while it starts to read the victim from the 
Beache. The VICTIM command and address follows the miss request. 
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In either case, the 21164 treats a miss/victim as a single transaction. Ifthe 
assertion of addr_bus_req_h or idle_bc_h causes the BIU sequencer té 
both the READ MISS and BCACHE VICTIM transactions are restarted 












request and then restarts the READ operation and resends thé : 
VICTIM command and data, then processes the READ MISS 


4.8.3.1 READ MISS with Victim (Victim Buffer) 


When the miss is detected, if the system has a victi 
for the next sysclk, then asserts a READ MIS 


sserts caok. h, the 21164 
nc the victim address. Each 
assertion of dack_h causes the cache river to ad¥ance to the next part of the 


block. Figure 4—20 shows the timing of a R “command with a victim. 
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Figure 4-20 READ MISS with Victim (Victim Buffer) Timing D 
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4.8.3.2 READ MISS with Victim (Without Victim Buffer) 


If the system does not contain a victim buffer, the 21164 stops reading the. 
Beache as soon as the miss is detected. This occurs while the second INT? 
data is on data_h<127:0>, as shown in Figure 4—21. 


A BCACHE VICTIM command is asserted at the next sysclk 
victim address. A Bcache read operation of the victim is also® 
sysclk edge. 


When dack_h is received for the first INT16 of the vicki 
reading the next INT 16 of the victim. cack_h can be a 
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4.8.4 WRITE BLOCK and WRITE BLOCK LOCK 


The WRITE BLOCK command is used to complete writes to shared data: €o 
remove Scache victims in systems without a Bcache, and to complet, wri 
operations to noncached memory. 


The WRITE BLOCK LOCK command follows the same proto 
qualifier allows the system to be more “conservative” on interl 
operations to noncached memory space. 


The WRITE BLOCK command to cached memory regio: 
the Scache sends data to the system and also causes tf 
the Beache. 


The 21164 asserts the WRITE BLOCK commant 
and the first 16 Chey of ae at the start of 






















request instead. 
When the system takes the first ps 








f the data is valid. For write operations to 
eINT4 with the int4_ valid_ see fice asserted 


Figure 4—22 e ti ming of a WRITE BLOCK command. 
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agram 


Figure 4-22 WRITE BLOCK Timing D 
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4.8.5 SET DIRTY and LOCK 
Figure 4—23 shows the timing of a SET DIRTY and a LOCK operation. 
The 21164 uses the SET DIRTY transaction to inform a duplicate:t 


that a cached block is changing from the SHARED, DIRTY stats 
SHARED, DIRTY state. When cack_h is received from the sys 












MOD command. 
4.8.5.1 When to Use a SET DIRTY and LOCK 









for these commands. 


¢ If BC_CONTROL [EI_CMD_GE 
SET DIRTY and LOCK com 


e If BC_CONTROL [EI_CG 
SET DIRTY and: EOCK 







when BC_CON'EROL IE Mi RP2] is clear. 
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Figure 4-23 SET DIRTY and LOCK Timing Diagram 
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4.8.6 Memory Barrier (MB) 











the instruction stream. The action taken by the 21164 depends u 
of BC_CONTROL<3>,[EICMD_GRP3]. 


e §6€6dif Pe ne [EI_CMD_GRP3] is set, the 21164 drains 


acknowledging receipt for the MB command. 
If BC -~CONTROL [EI_ CMD _GRP3] i is clear, it is 





[EI_CMD_GRP3] is clear. 
4.8.6.1 When to use a MEMORY BARRIER comme n 










4.8.7 FETCH 


The 21164 passes a FETCH cont 
instruction. 


48.8 FETCH_M 
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4.9 System-Initiated Transactions 


System commands to the 21164, are driven on the emd_h<3:0> signal 
The algorithm used by the 21164 for accepting system comma 
processed in parallel by the 21164 is presented in Section 4.9 














System-initiated commands may be separated into two pré 
group of commands used by write invalidate protocol syste 
described in Section 4.9.2. The group of commands v u a:b 

systems is listed and described in Section 4.9.3. 


4.9.1 Sending Commands to the 21164 


The rules used by the Cbox BIU to process ¢ 
the 21164 are listed in Section 4.12.1. 


The algorithm used by the see to ser 


1 groups. The 
is listed and — 
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Figure 4-24 Algorithm for System Sending Commands to the 21164 


Yes 
Set count to zero. 







Is 
(CMD Not NOP) 
and 
(Count ,2) 
? 






Increment ¢ 





READ or FLUSH 

or READ DIRTY INV 

or READ DIRTY. 
? 













& Decrement count. 
CK/Bca 
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4.9.2 Write Invalidate Protocol Commands 
All 21164-based systems that use the write invalidate protocol expect hat the 


commands are defined in Table 4—10. 


Table 4-10 System-lInitiated Interface Commands (W: 





cmd_h 
Command <3:0> Description 
NOP 0000 


INVALIDATE 0010 


SET SHARED 0011 





atte. The 21164 assumes that this action is correct, | 
.because the system would have sent a READ DIRTY 
command if the dirty bit were set. 


If the block is found in the Scache, the 21164 
responds with ACK/Scache. Otherwise, if the system 
contains a Beache, the block is assumed to be in the 
Beache, and the 21164 responds with ACK/Bcache. 
If the system does not contain a Bcache, and the - 
block is not found in the Scache, the 21164 responds 
— with NOACK. 


(continued on next page) 
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Table 4—10 (Cont. ) System-initiated Interface Commands (Write invalidate 













Protocol) 
cmd_h 
Command <3:0> Description 
Read a block; set shared. The RE ) DIRTY 


READ DIRTY 0101 


block is present and dirty. If the bloe! 
or if the block is clean, and. 


READ DIRTY 0111 
AINVALIDATE 





No Bcache NOACK 
No Beache. ACK/Scache 
ACK/Bcache 
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READ DIRTY and READ DIRTY/INVALIDATE Commands 


No Beache Scache_Miss 

No Beache Scache_Hit,Not Dirty 
No Bcache Scache_Hit, Dirty 
Beache Scache_Hit, Dirty 
Beache Scache_Miss 







in a multiprocessor system. 


The 21164 responds to the READ FLU: : , SET SHARED 
and READ DIRTY/INVALIDATE coni# ; 6n addr_res_h<2> as listed in 
Table 4-12. 








L164 best-case response time to sytem commands in 
protocol ‘system. 


Response Number of sys_cik_out1_h,! Cycles 


NOACK 8 CPU cycles rounded up to next 
sys_clk_out1_h,] cycles 

ACK/Scache 12 CPU cycles rounded up to next 
sys_clk_out1_h,] cycles 

NOACK, ACK/Scache, 10 CPU cycles rounded up to next 

ACK/Bcache sys_clk_out1_h,] cycles 
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4.9.2.2 READ DIRTY and READ DIRTY/INVALIDATE & 
The READ DIRTY command is used to read modified data from the cac 





command except that the block is changed to VALID i ' 
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sys_clk_outi_h 





Figure 4-25 READ DIRTY Timing Diagram (Scache Hit) 
addr_bus_reg_h 





victim_pending_h 














data_ram_we_hi= 


addr_h<39:4> 
index_h<25:4> 
data_h<127:0> 
data_ram_oe_h 


addr_res_h<2:0> 
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4.9.2.3 INVALIDATE 








The Scache is probed and invalidated if the block is found. The Beaé 
invalidated without probing. Figure 4—26 shows the timing of an ENVALI 
transaction. F 
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Figure 4-26 INVALIDATE Tim 
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4.9.2.4 SET SHARED 








When the 21164 receives a SET SHARED command, it probes the Scach 
and changes the state of the block to SHARED if it is found. The 21164 
“assumes” that the block is in the Bcache and writes the state of thé 
SHARED, DIRTY. Figure 4—27 shows the timing of a SET SH 
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Figure 4-27 SET SHARED Timing Diagram 
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iminary 


Prel 


4.9.3 Flush-Based Cache Coherency Protocol Commands 


A flush-based design using the 21164 “assumes” the system will use the 
and FLUSH commands defined in Table 4—14 to maintain cache coheren 








Command 








a and "Beache to see if the requested block is 
1t. If the block is present, the 21164 responds 
with ACK/Scache or ACK/Bcache. If the data is 
Scache, the data is driven on the data_h bus 

e same sysclk as the ACK. If the data is in 
the Beache, a Beache read operation begins in 

the same sysclk as the ACK. If the block is not 
present in either cache, the 21164 asserts NOACK 
on addr_res_h<1:0>. 
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4.9.3.1 21164 Responses to Flush-Based Protocol Commands 


The system responds to flush-based protocol commands on addr_res e10> 
as shown in Table 4—15. 


Table 4-15 21164 Responses to Flush-Based Protocol Commands 
READ and FLUSH Commands 





Bcache Status Scache Status 

No Bcache Scache_Miss 

No Bcache Scache_Hit,Not Dirty 
No Beache Scache_Hit,Dirty 


Bcache_Miss = “ 
Beache_Hit Scache_Hit, Dirty 2 





ACK/Scache 
Bcache_ Hit, Scache_Miss/Hit, NOACK 
Not Dirty 
Beache_Hit,Dirty Scache_Misé ACK/Bcache 


The purpose of addr_resh 
store to co if a bk 


system without a duplicate tag 
in the Scache or lock register. The 


Table 4—17 presents the 21164 best-case response time to sytem commands in 
write invalidate protocol system. 
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Table 4-17 Minimum 21164 Response Time to Write Invalidate Protocal 


















Commands 
Cache Status Response 
No Beache NOACK 
No Bcache ACK/Scache 
Beache NOACK, ACK/Scache, 


ACK/Beache 





Cache Status 
No Bcache NOACK 


Response 


No Bcache ACK/Scache 


Bcache 
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cmd_h<3:0> 
addr_h<39:4> 
index_h<25:4> 





sys_cilk_outi_h 
addr_bus_reg_h 
victim_pending_h 
addr_res_h<2:0> 


Figure 4-28 FLUSH Timing Diagram (Scache Hit) 


0> 


data_h<127, 
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4.9.3.3 READ 


The READ command is used by the system to read DIRTY data from th 


21164. The tag control status does not change. Figure 4—29 shows the ti 
and tag control status of a READ transaction. 
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Timing Diagram (Scache Hit) 


Figure 4-29 READ 
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4.10 Data Bus and Command/Address Bus Contention 


4.10.1 













The data bus is composed of data_h<127:0> and data_check_h<15:0>. 
The command/address bus is composed of cmd_h<3:0>, addr_h< 
addr_cmd_par_h. 


The following sections describe situations that have contentio: 
data bus or contention for use of the command/address bus. 


volend ncares? Bus 


sys_cik_out1_h 
addr_bus_req_h 


21164 Drive 
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4.10.2 Read/Write Spacing—Data Bus Contention 


The data bus, data_h<127:0>, can be driven by the 21164, the Beac 
or the system. 












In the case of private Bcache write operations followed by px 
operations, the 21164 stops driving the data bus well in 
turning on. 


For private Bcache read operations followed by priv 
operations, the 21164 inserts a ip aad nun 





cycles between 
i¢he. drivers to 
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4.10.3 Using idle_bc_h and fill_h 
The 21164 uses the idle_ be_h and fill_ h signals to fill data into the Beattie. 



















completes any Bcache transaction it might have started while 
fill data. 


Signal fill_h is asserted a fixed number of sysclk cycles befor: 
start the fill transaction in the Bcache. 


At the end of the fill, the 21164 waits five CPU cycles 
write operation. This time should allow the system t 
in practice, this is not enough time, the ayeien May ass 
gain additional cycles. 


Calculating Time to Assert idle_bc_h 
The equations for calculating length 


read hit idle = 2 + (block size/16) + ® 
tristate_ ram turn off - 
read miss idle = ce 


write idle = 


m off times, if the system will not turn on its 
onds after the 21164 starts driving Bcache 


34 0=24 


= 4+ (64/16) * 5+0=24 
/6), (19/6), (24/6) = 4 


4-70 Preliminary Edition-September 1994 




















If the 21164 receives asserted idle_be_h at sysclk edge N, the FIL 
can be received at sysclk edge N+3. The 21164 drives index_h<25: 
the Bcache on sysclk edge N+4. 


Figure 4-31 Example of Using idle_bc_h and fill_h 


sys_clk_outt_h,! 
idle_be_h 

fill_h 

dack_h 
index_h <25:4> 
data <127:0> 52: SE j 


Minimum idle“... 
If the sytem conti 
than or ae 
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4.10.4 Using data_bus_req_h 


The signal data_bus_req_h can be used along with the idle_be_h signatto 
prevent the 21164 and the Beache from driving the data bus. In general the: 
system should not need to use this feature but it is useful if the.sys 
other devices on the data bus. 















To gain control of the data bus, the system must ensure that { 
by naire idle_be_h for the required time. It can then 


Figure 4-32 Using data_bus_req_h_ 


sys_cik_out1_h,| 
idle_bc_h 
data_bus_req_h 


21164 drive 
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4.10.5 Tristate oe 











somewhere in the middle: 


4.10.5.1 READ or WRITE to FILL 


The time required to tristate the 21164 d : 
command, or the the Bcache drivers at the 
the idle_be_h equation. 


BCACHE VICTIM to FILL 
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Figure 4-33 READ MISS Completed First—Victim Buffer 












CPU Clock Cycles 


sys_cik_outi_h 


dack_h 


index <25:4> 


data_h <127:0> 


MLO-012415 








ck edge is labeled N, then the READ MISS 
fe Hext sysclk edge, and the data_ram_oe_h will 
dge of &PU clock N+S+1, where S is the sysclk ratio. 

, it will take an extra sysclk to send the READ MISS 


the corresponding 1 risi 
command will 
deassert at th 
If the sysclk } 
command;.so 
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Figure 4-34 READ MISS Second—No Victim Buffer 







N+S N+S+1 


CPU Clock Cycles 


sys_clk_out1_h 


index <25:4> 
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4.10.5.3 System Bcache Command to FILL 


At the end of a system command that uses the Bcache, the system mu 


provide enough time for the Bcache drivers to turn off before returning any i] 
data. 


The final dack_h will be sampled by the 21164 on the rising edge 
If the corresponding rising CPU clock edge is labeled N, data: 
deassert at the rising edge of CPU clock N+5. 




















Figure 4-35 System Command to FILL Example 1 


CPU Clock Cycles 
sys_clk_outi_h 
dack_h 

index <25:4> 


data_h <127:0> 
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If the system command was a SET SHARED or an INVALIDATE cok mand, 














to complete. During the write transaction, data_ram_ oe_ 
but not tag_ram_oe_h. At the end of the write trans: 


CPU Clock Cycles 
sys_cik_out1_h 


addr_res_h <1 0> 
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4.10.5.4 FILL to Private Read or Write Operation & 
At the end of the fill, the 21164 does not begin to drive the data bus untifthe 














assert data_ram_oe_h until the fifth cycle after the sysclk that ke 2 
dack_h. ‘ 


Systems requiring more time to turn off their drivers must né 
requests and must use idle_bc_h and data_bus_req_h at the 
stop 21164 requests. 


Figure 4-37 FILL to Private Read or Write | 


N N+1 N+2 NeQN+4 Na 


CPU Clock Cycles 


sys_clk_outi_h 


index <25:4> 
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4.11 21164 Interface Restrictions 


This section lists restrictions on the use of 21164 interface features. 













4.11.1 FILL Operations after Other Transactions 


If the system has removed data from the 21164 with any. rel tons 
commands, or removed a Beache victim from the Beache, y 


assertion of dack_ h. 
4.11.3 Systems Without a Beach¢ 





to: ‘assert dack_h. If the system asserts dack_h for 
scack_h and efail_h together, and the sysclk ratio is 


“h are asserted at any time without asserting dack_h, the 
ition will be failed correctly. 


f; cack_h, — cfail_h causes the write operation to fail correctly. 
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4.12 21164/System Race Conditions 


When certain sequences of transactions occur on the interface between thi 
21164, the Bcache and the system race conditions may occur. The rules fo 
of the interface by the 21164 and the system are listed in Sections4. 

















Examples of race conditions to be avoided are described and ith trated i in 
Section 4.12.2 through Section 4.12.6. 


4.12.1 Rules for 21164 and System Use of External I jie 


1. Ifidle_be_h is not asserted aa there ar 
command buffer, then the BIU is free to. 


The assertion of idle_bc_h, ort 
than NOP to the 21164, a 


tem comand is under way. 


“A READ MISS with a BCACHE VICTIM transaction is treated as an 
itomic pair. The command order, READ MISS then BCACHE VICTIM or 
}CACHE VICTIM then READ MISS, is programmable. Either way, if the 
first command is acknowledged with cack_h, then both commands must be 
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acknowledged with cack_h and all the data acknowledged with lack_ h, 
before the 21164 responds to any other request. 


9. The cack_h acknowledgment for a WRITE BLOCK or BCACHE 














transactions, it is possible to acknowledge all but the 4 
decide to do something else. 


. For a READ MISS transaction, cack_h must be : 


the same command unless: 


a. A system request is received that 
by the oreina! 21164 request. : 





command will not be 8 
the BIV.- 
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Figure 4-38 READ MISS with Victim Example 
















sys_clk_out1_h Cycles 


cmd_h<3:0> 


addr_h<127:0> 


victim_pending_h 
addr_bus_req_h 


idle_be_h 


cack_h 


index <25: 


MLO-012420 
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4.12.3 idle_bc_h and cack_h Race Example 


In this example, idle_be_h and cack_h are asserted in the same sy otk. The 
system takes the READ MISS and BCACHE VICTIM command befor doing 
anything else. The last dack_h meets the requirement that 

before or with the last dack_h. 
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Figure 4-39 idle_bc_h and cack_h Race Example 


















sys_clk_outi_h Cycles 


cemd_h<3:0> 


addr_h<127:0> 


victim_pending_h 
addr_bus_req_h 


idle_bc_h 


cack_h 


dack_h 


index <25:4> 2 


MLO-012421 
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In this example, the 21164 has started a Bcache read operation that ‘fiiisses. 











sys_cik_out1_h Cycles 


cmd_h<3:0> 


addr_h<127:0> 
victim_pending_h 
addr_bus_req_h 


MLO-012422 
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4.12.5 READ MISS with Victim Abort Example 
In this example, the 21164 produces a READ MISS command with a vic 


21164 produces. a clean READ MISS transaction. 
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Figure 4-41 READ MISS with Victim Abort Example 


















victim_pending_h 
addr_bus_req_h 
addr_bus_req_h 


addr_res_h <2:0> 


idle_bc_h 


a fill from the system. A Bcache hit to index j take places while waiting for 
the fill. The system then returns the requested data in two bursts, asserting 
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Figure 4-42 Bcache Hit Under READ MISS Example 


addr_h <127:0> 


victim_pending_h 


addr_bus_req_ h 


fill_h 


idle_bc_h 


cack_h 


dack_h 


index <25:4> 





data_h <127:0> 
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4.13 


4.13.1 


Data Integrity, Bcache Errors, and Command/Address 
Errors 











described. 


Data ECC and Parity 
The 21164 supports INT8 error correction code Cpu. 


written into the Beache. FILL data from the. 
checked for errors. The receiving node detes 


Uncorrected data from the Beache or sy 
files. If a correctable error is detaét 
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Figure 4-43 ECC Code 


CBO 
CB1 
CB2 
CB3 


CB4 
CB5 
CB6 


CB7 . 


CB2 and CB3 are calculated for CDD parity (an odd number of 1s counting .- 
the CB). 


CBO, CB1, CB4, CB5, CB6, and CB7 are calculated for EVEN parity (an 
number of 1s counting the CB). 















0123 4567 8901 2345 6789 0123 4567 8901 2345 6789 0123 4567 8901 2345 6789 


Pa De rs Oe Oe OO PP Oe De 
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are ee ee sc Se es OP | 
x Ds rere Oe a Oe Pa ee Pe Oe Rs PP Oe PR Bo 


eM ATTY ace es. Te AD VAT ee ee es 11..17 1771 2... 11... 
snatoee>s aneset 1441 9917 2.0... 2... 9997 1194 2.0.00... 1997 1117 =... 
VATOTTA cei Se hE arts 41770°09997«7979 «1997 22 ee 


ASTACANAT, ic Bais te. poeane’s 1149 1997 2... 2... 9999 1991 1117 
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in Table 4-19. 


e 


Lower 64 bits 
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For x4 RAMs, the following bit arrangement detects nibble errors: 































CBO CBl CBS CB6 
CB2 DO D4 DS 
CB3 CB4 D7 D8 
CB7 D2 D3 D1l 
D1 D6 D10 D13 
D9 D14 D18 D21 
D12 D16 Di7 D22 
D15 D19 D20 D23 
D24 D25 D27 D30 
D26 D28 D29 D31 
D32 D34 D35 D37 
D33 D36 D38 D40 
D39 D4l D43 D46 
D42 D44 D45 D47 
D48 D50 DS51 - D53 
D49 D52 DS4 D56 
D55 D57 D59 D62 
D58 D60 D61 D63 


4.13.2 Force Correction 
Setting BC_CTL<4>, [CORR_ i 


4.13.3 


a. is used to maintain parity over tag_data_h<38:20>. 
ity error is usually not recoverable. 


Beache victim is processed according to the tag control status alone, not the 
tag control parity bit. The Cbox records the Bcache probe address and the tag 
ntrol value read from the Bcache. A tag control parity error causes a trap to 
L.code, which handles the error condition. 
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4.13.5 Address and Command Parity 


The signal line addr_cmd_par_h is used to maintain odd parity over 
addr_h<39:04> and emd_h<3:0>. 


4.13.6 Fill Error 


The signal fill_error_h is asserted by the system to notify the: 
error has occurred. 











Systems in which a fill error timeout is not expected, sue 


but has no other effect. 


4.13.7 Forcing 21164 Reset 
Assertion of cfail_h a a sys 
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4.14 Interrupts 


The 21164 has seven interrupt signals that have different uses during 
initialization and normal operation. 


Figure 4—44 shows the 21164 interrupt signals. 












Figure 4-44 Alpha 21164 Interrupt Signals 








irq_h<3:0> 
sys_mech_chk_irq_h 





pwr_fail_irq_h 
mch_hit_irq_h 





clock configuration 
Section 4.2.3:<éserib 


ble 4-20 shows which interrupts are enabled for a given interrupt priority 
vel (IPL). An interrupt is enabled if the current IPL is less than the target 
IPL of the interrupt. 
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Table 4-20 Interrupt Priority Level Effect 


Interrupt Source 





Software Interrupt Request 1 
Software Interrupt Request 2 
Software Interrupt Request 3 
Software Interrupt Request 4 
Software Interrupt Request 5 
Software Interrupt Request 6 
Software Interrupt Request 7 
Software Interrupt Request 8 
Software Interrupt Request 9 

Software Interrupt Request 10 
Software Interrupt Request 11 
Software Interrupt Request 12 
Software Interrupt Request 13 
Software Interrupt Request 14 
Software Interrupt Request 15 


Power fail interrupt! 


System machine check interr 
detected correctable ir 
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Target 
IPLio 


1 


Source 


Internal 


























On rn oa fP &D DW 


Internal 
Internal 
Internal 
Internal 
Internal 
Internal 
Internal 


Internal 

pwr_fail_irg_h 
sys_mch_chk_irg_h and 
internal 

irg_h<0> 

irg_h<1> 


irq_h<2> 


1[These interrupts are from external sources. In some cases, the system environment provides the 
OR of multiple interrupt sources at the same IPL to a particular pin. 


(continued on next page) 


Table 4-20 (Cont.) Interrupt Priority Level Effect 


Target 
IPLio 









Interrupt Source 


External interrupt 23! (I/O eine at 
IPL 23) 


Halt} 


1These eee are from external sources. In some cases, the sy: : 
logic-OR of multiple interrupt sources at the same IPL iculay: 





not sical executing PALcode. “Bek 
PAL dispatch address, the pipeline i is 
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This chapter describes the 21164 microprocessor i t 
(IPRs). It is organized as follows: . 










Internal Processor Registers 








r registers 


Instruction fetch/decode unit and bran 


Memory address translation unit ( 








Cache control and bus interface 
PAL storage registers 


Restrictions 







e), and backup cache (Beache) IPRs are 
Ss region FF FFFO 0000 to FF FFFF FFFF. 


=Scache, and Beache IPRs. Table 5-38 lists 


Note 
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Table 5-1 Ibox, Mbox, Dcache, and PALtemp IPR Encodings 
IPR Mnemonic Access Indexi¢ Ibox Slots to Pipe 


Ibox IPRs 

ISR 

ITB_TAG 
ITB_PTE 
ITB_ASN 
ITB_PTE_TEMP 
ITB_JA 

ITB_IAP 

ITB_IS 

SIRR 

ASTRR 

ASTER 
EXC_ADDR 
EXC_SUM 
EXC_MASK 
PAL_ BASE 

PS 

IPL 

INTID 
IFAULT_VA_FORM 


100 El 
101 El 
102 
103 


gggg****232** 
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(continued on next page) 


Table 5-1 (Cont.) Ibox, Mbox, Deache, and PALtemp IPR Encodings: 
IPR Mnemonic Access Indexi¢ Ibox Slots to Pipe 

PALtemp IPRs 
PALtemp0 R/W 140 
PALtemp1 R/W 141 
PALtemp2 R/W 142 
PALtemp3 R/W 143 
PALtemp4 R/W 144 
PALtemp5 R/W 145 
PALtemp6 R/W 146 
PALtemp7 
PALtemp8 
PALtemp9 - 
PALtemp10 
PALtemp11 
PALtemp12 
PALtemp13 
PALtemp14 
PALtemp15 
PALtemp16 
PALtemp17 
PALtemp18 
PALtemp19 
PALtemp29 






























(continued on next page) 
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Table 5—1 (Cont.) Ibox, Mbox, Dcache, and PALtemp IPR Encodings 
IPR Mnemonic 
DTB_TAG 
DTB_PTE 
DTB_PTE_TEMP 
MM.STAT 
VA 

VA_FORM 
MVPTBR 
DTBIAP 

DTBIA 

DTBIS 
ALT_MODE 

CC 

CC_CTL 

MCSR 
DC_FLUSH 
DC_PERR_STAT 
DC_TEST_CTL 
DC_TEST_TAG  . 
DC_TEST_TAG T 
DC_MODE 
MAF MODE 















Indexi¢ Ibox Slots to Pipe 
202 EO 
203 EO 
204 = BO 
205 EO 
206 EO 
207 


gg <a <sseeeserera i 
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5.1 Instruction Fetch/Decode Unit and Branch Unit (Ibex) 
IPRs 


The Ibox internal processor registers (IPRs) are described in Seéti 
through Section 5.1.27. 













5.1.1 Istream Translation Buffer Tag Register (ITB_TAG 
ITB_TAG is a write-only register written by hardware. 






fields of an ITB entry are updated simultaneously: ‘by 
ITB _PTE register. This write operation causes. the co 
he fre: 


format. 


Figure 5-1 Istream Translation | 


32 


VA<42:13> 


LJ-03473-T10 
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5.1.2 Instruction Translation Buffer Page Table Entry (ITB_PTE) 
Register 
ITB_PTE is a read/write register. 


Write Format 


A write operation to this register writes both the PTE and TA‘ 
ITB location determined by a not-last-used replacement algori 
and PTE fields are updated simultaneously to ensure the.j 















fields of an 






write format. 


Figure 5-2 instruction Translatio 


PFN<39:13> 


LJ-03474-TlO 





ister returns the PTE pointed to by the NLU pointer to the ITB PTE_ 

‘MP register and increments the NLU pointer. If the HW_MFPR ITB_PTE 
struction falls in the shadow of a trapping instruction, the NLU pointer may 
be itiéremented multiple times. A zero value is returned to the integer register 
file. A’second read of the ITB_PTE_TEMP register returns the PTE to the 
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general purpose integer register file (IRF). Figure 5-3 shows the ITB, PTE 
register read format. 








Figure 5-3 Instruction Translation Buffer Page Table Entry (ITB_P 
| Format 


29 28 22 2120 19 18 17 14 13 12 


ERE 
SRE 
URE 
GHD<2:0> 


LJ-03475-TI0 


Preliminary Edition-September 1994 5-7 


5.1.3 Instruction Translation Buffer Address Space Number (ITB ASN) 
Register 


ITB_ASN is a read/write register that contains the address space ; 
(ASN) of the current process. Figure 5—4 shows the ITB_ASN r 







Figure 5-4 Instruction Translation Buffer Address Space Nuit 


(ITB_ASN) 
Register By 












31 . 1110 


RAZ/IGN ANCE: 
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5.1.4 Instruction Translation Buffer Page Table Entry Temporar 
(ITB_PTE_TEMP) Register 


ITB_PTE_TEMP is a read-only holding register for ITB_PTE r 
of the ITB_PTE register returns data to this register. A ate 












Extent Type 
GHD <29> RO 
GHD | <30> RO 
<31> 








Register | 
ITB_IAP is a write-only ref is 
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5.1.7 Instruction Translation Buffer IS (ITB_IS) Register 


ITB_IS is a write-only register. Writing a virtual address to this registe 
invalidates the ITB entry that meets either of the following criteri 


e An ITB entry whose virtual address (VA) field matches ITB g 
whose ASN field matches ITB_ASN<10:04>. 


¢ An ITB entry whose VA field matches ITB_IS<42:13> and w: 
is set. 


Figure 5-5 shows the ITB_IS register format. 





Figure 5-5 Instruction Transiation Buffer IS (ITB..IS) Register . 





VA<42:13> 






32 


VA<42:13> 
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IFAULT_VA_FORM is a read-only register containing the formatted f 
virtual address on an ITBMISS/IACCVIO (except on IACCVIO. 


Figure 5-6 Formatted Faulting Virtual Address (IFAULT_V. 
Mode=0) 


VA<42:13> 


VA<42:13> 


LJ-03479-TI0 


Figure 5—7 shows the IFA FORM ’‘régister format in NT mode. 


Figure 5-7 Formatted Fauiting Vint 


Address (IFAULT_VA_FORM) Register (NT_ 
Mode=1) . 


3130 29 


VA<31:13> 


VPTB<63:30> 
VPTB<63:30> 
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5.1.9 Virtual Page Table Base Register (IVPTBR) 
IVPTBR is a read/write register. Bits <32:30> are UNDEFINED on a re 











mode. 


Figure 5-8 Virtual Page Table Base Register (IVPTBR) (NT 


3130 29 


MAQ602 


({NT_Mode=1) 
3130 29 


VPTB<63:30> 
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5.1.10 Icache Parity Error Status (ICPERR_STAT) Register 


ICPERR_STAT is a read/write register. The Icache parity error stat 
be cleared by writing a 1 to the appropriate bits. Figure 5-10 and Tab i 3 . 
describe the ICPERR_STAT register format. 











Figure 5-10 Icache Parity Error Status (ICPERR_STAT) F 


13121110 = 


DPE 
TPE 
TMR 


LJ-03482-T10 


Table 5-3 Icache Parity Er 
Extent 


q Data parity error 
TPE dl «Tag parity error 


5.1.11 Icache F| . RgC_ FLUSH CTL) Register 
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5.1.12 Exception Address (EXC_ADDR) Register 







(PC) to this register. In case of precise exceptions, 
value of the instruction that caused the exceptio 
exceptions/interrupts, this is the PC value of the: es 








Bit <00> of this register is used to indie 
instruction, the mode of the system. 


LJ-03483-TI0 
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5.1.13 Exception Summary (EXC_SUM) Register 













register format. 
Figure 5-12 Exception Summary (EXC_SUM) Regist 


17 16 15 141312 1110 09: 


pares ABBR REE 


Swc 
INV 
DZE 
FOV 
UNF 
INE 
lov 
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Table 5-4 


Name 


SWC 


vooaiiee 


Indicates software completion possible. This bit 
is set after a floating-point instruction containing 
the /S modifier completes with an arithmetic trap 
and if all previous floating-point instructions that 
trapped since the last HW_MTPR EXC_SUM 
instruction also contained the /S modifier. 


The SWC bit is cleared whenever a floating-point 
instruction without the /S modifier completes 
with an arithmetic trap. The bit remains cleared 
regardless of additional arithmetic traps until the 
register is written by an HW_MTPR instruction. 
The bit is always cleared upon any HW_MTPR 
write operation to the EXC_SUM register. 


(continued on next page) 
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Table 5-4 (Cont.) Exception Summary Register Fields 


Name Extent Type 
INV <1l1> WA 
DZE <12> WA 
FOV <13> WA 
UNF <14> WA 
INE <15> WA 
IOV <16> WA 


5-16 Preliminary Edition—September 1994 












Description 


Indicates invalid operation. 
Indicates divide by zero. 

Indicates floating-point overflow: 
Indicates floating-point u derfiow. 






Indicates floating ine 
Indicates floating-poirit:e on tant <Fbox) 
convert to integer overflow nteger arithmetic 





overfiow. 


5.1.14 Exception Mask (EXC_MASK) Register 


EXC_MASK is a read/write register that records the destinations0 0 
instructions that have caused an arithmetic trap between EXC.. MASK ¥ ite 
k.at 









representing FO-F31 and 10-131. A write operation to EX 
EXC_MASK register. Figure 5-13 shows the EXC_MASK:: 





Figure 5-13 Exception Mask (EXC_MASK) Registe: 






131130129... 







F31F30F29... 
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5.1.15 PAL Base Address (PAL_BASE) Register 


PAL_BASE is a read/write register containing the base address for PALcad: 
The register is cleared by hardware on reset. Figure 5-14 shows the PAL: 
BASE register format. ee 










Figure 5-14 PAL Base Address (PAL_BASE) Register 


_ BASE<39:14> 


LJ-03486-TIO 
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5.1.16 Processor Status (PS) Register 


PS is a read/write register containing the current mode bits of the 
architecturally defined processor status as described in the Alpha Ar 
Reference Manual. Figure 5-15 shows the PS register form 


Figure 5-15 Processor Status (PS) Register 


LJ-03487-TIO 
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5.1.17 lIbox Control and Status Register (ICSR) 


ICSR is a read/write register containing Ibox-related control and status 
information. Figure 5—16 and Table 5—5 describe ICSR format. 









Figure 5-16 Ibox Control and Status Register (ICSR) 


31 30 29 28 27 26 25 24 23 20 19 : 10 09 08 07 


ise ae a al 


RAZ/IGN 


CRDE 
SLE 
FMS 
FBT 
FBD 
MBO 
ISTA 
TST 
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Table 5-5 Ibox Control and Status Register Fields 


Name Extent 


PME<1:0> <09:08> 


IMSK<3:0> <23:20> 


TMM <24> 
TMD <25> 
FPE <26> 


HWE <27> 


SPE<1:0> <29:28> 


Type 
RW,0 


RW,0 


RW,0 


RW.0 


RW,0 


RW,0 






















Description 
Performance Counter ar te 
either PME<I1> or PME<0: } 


enabled according to the set 
CTL fields. 


: SPE<1> i is set, it enables superpage mapping 


Stream virtual address VA<39:13> directly 
to Plysical address PA<39:13> assuming 
VA<42: 41> = 10. Virtual address bit VA<40> 
ignored in this translation. Access is allowed 





“only in kernel mode. 


If SPE<0> is set (NT mode), it enables 
superpage mapping of Istream virtual addresses 
VA<42:30> = 1FFEi¢ directly to physical address 
PA<39:30> = 016. VA<30:13> is mapped directly 
to PA<30:13>. Access is allowed only in kernel 
mode. 


If set, enables PAL shadow registers. 
If set, enables correctable error interrupts. 
If set, enables serial line interrupts. 


If set, forces miss on Icache references. MBZ in 
normal operation. 


(continued on next page) 
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Table 5-5 (Cont.) Ibox Control and Status Register Fields 


Name Extent 
FBT <35> 
FBD <36> 


Reserved <37> 
ISTA <38> 


TST <39> 
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Type 
RW,0 


RW0 


RW,1 
RO 


RW,0 












Description 


If set, forces bad Icache tag parity. 
normal operation. se 


If set, forces bad Icache data p 
normal operation. 


Reserved to Digital. Mu: 


Writing a 1 to this bit ass 
h<I> signal. = 


5.1.18 Interrupt Priority Level (IPL) Register 


IPL is a read/write register containing the value of the interrupt pri 
level (IPL). Whenever hardware detects an interrupt whose target IP 
is greater than the value in IPL<04:00>, an interrupt is tak: : 
shows the IPL register format. 






vel 





Figure 5—17 Interrupt Priority Level (IPL) Register 
31 


: IPL<4:0> 


LJ-03489-TI0 
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5.1.19 Interrupt ID (INTID) Register 


INTID is a read-only register that is written by hardware with the targ 
interrupt priority level of the highest priority pending interrupt. The har 
recognizes an interrupt if the IPL being read is greater than the.IPE 
IPL<04:00>. 


Interrupt service routines may use the value of this register 1 
cause of the interrupt. PALcode, for the interrupt service, 
IPL level in INTID is greater than the IPL level specified 










se must chi 6k the interrupt 
‘terrupt has occurred. 


INTID<4:0> 


LJ-03490-TI0 
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5.1.20 Asynchronous System Trap Request Register (ASTRR) ; 
ASTRR is a read/write register containing bits to request ee 






set and the current processor mode given in the PS<04:03> Shi 


or higher than the mode associated with the AST request: 
the ASTRR format. 


LJ-03491-TIO 
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5.1.21 Asynchronous System Trap Enable Register (ASTER) 


ASTER is a read/write register containing bits to enable corresponding ; 
asynchronous system trap (AST) interrupt requests. Figure 5-20 gs 
ASTER format. Z 


LJ-03492-TIO 
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5.1.22 Software Interrupt Request Register (SIRR) 


SIRR is a read/write register used to control software interrupt requ its. 
A software request for a particular IPL may be requested by setting t 
appropriate bit in SIRR<15:01>. Figure 5-21 and Table 5-6 déseribex 
format. 













Figure 5-21 Software Interrupt Request Register (SIRR) 


31 19 18 00 


> | RAZ/IGN 


LJ-03493-Tl0 


Name Extent 


SIRR<15:1> <18i(4> uest software interrupts. 
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5.1.23 Hardware Interrupt Clear (HWINT_CLR) Register 


HWINT_CLR is a write-only register used to clear edge-sensitive hardware. 


interrupt requests. Figure 5-22 and Table 5—7 describe the HWINT_CL 
register format. "i 


























Figure 5-22 Hardware Interrupt Clear (HWINT_CLR) Registé 


31 30 29 28 27 26 


-PCOC 
PCIC 
PC2C 


34 33 32 


SLC 
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Table 5-7 Hardware:interr 
Extent Typ: 


Name 





PCOC 
PC1C Clears performance counter 1 interrupt requests 
PC2C “Clears performance counter 2 interrupt requests 
CRDC 


Clears correctable read data interrupt requests 
Clears serial line interrupt requests 
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5.1.24 Interrupt Summary Register (ISR) 


ISR is a read-only register containing information about all evap’ ha dware, 










and Table 5-8 describe the ISR format. 


Figure 5-23 Interrupt Summary Register (ISR) 


31 30 29 28 27 26 25 24 23 22 21 2019 18 04 03 


TITS L Pee 
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Table 5-8 Interrupt Summary Register Fields 


ASTRR<3:0> <03: 00> 
and 


ASTER<3:0> 
SISR<15:1> 


ATR 
120 
{21 
122 


123 
PCO 


PC1 


PC2 


PFL 


MCK 


CRD 
SLI 
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Extent 


<18:04> 
<19> 
<20> 
<21> 
<22> 


<23> 
<27> 


<28> 


<29> 


<30> 


<31> 


RO 


Type 
RO 


RO,0 
RO 
RO 
RO 
RO 


RO 
RO 


RO 



























Description 


Enabled AST requests 3 through: js 
IPL 2 


Software interrupt requests - 
corresponding to IPL 15 throug’ 


xternal hardware interrupt—system machine 
neck (IPL 31) 


Camedabis ECC errors (IPL 31) 
Serial line interrupt 
External hardware interrupt—halt 


5.1.25 Serial Line Transmit (SL_XMIT) Register 


SL_XMIT is a write-only register used to transmit bit-serial data out: 
microprocessor chip under the control of a software timing loo 
of the TMT bit is transmitted off es on the srom_ cllx_ h = ‘ 
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Table 5-9 Serial! Line t 


mit Re ster Fields 


Name 
TMT 


Description 


® Serial line transmit data 
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5.1.26 Serial Line Receive (SL_RCV) Register ‘ 
SL_RCV is a read-only register used to receive bit-serial data under the ce rol 





connected to the srom_ data_ h signal. A serial line interrupt iste 









whenever a transition is detected on the srom_data_h signal a 
in the ICSR is set. During normal operations (not in test mod 
data_h signal is overloaded and serves both the serial line recepti 
Icache serial ROM (SROM) interface. Figure 5-25 and Tabil#.5—10 
SL_RCV register format. 


Figure 5-25 Serial Line Receive (SL_RCV) Register 


RCV 
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Description 


Serial line receive data 
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5.1.27 Performance Counter (PMCTR) Register 


PMCTR is a read/write register that controls the three on-chip performa: 
counters. Figure 5-26 and Table 5-11 describe the PMCTR format. “= 








CTR<13:0> 


SELO 


MA-0601 
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Table 5-11 Performance Counter Register Fields 













name Extent Type Description 

CTRO<15:0> <63:48> RW A 16-bit counter of events selecte (3 
enabled by CTL0<1:0>. : 

CTR1<15:0>  <47:32> RW A 16-bit counter. 

SELO <31> RW 


Ku <30> RW 


CTR2<13:0> <29:16> RW 
CTLO<1:0> <15:14> RW,0 


CTL1<1:0> <13:12> RW,0 



















2 countér control: 
unter disable,interrupt disable 

01 citer enable, interrupt disable 

10 counter enable, interrupt at count 16384 
-11 counter enable, interrupt at count 256 


reall PALmode—disables all counters in 
. PALmode (refer to Table 5—13). 


Kill kernel, executive, supervisor mode— 
disables all counters in kernel, executive, 
and supervisor modes (refer to Table 5—13). 
Ku=1, Kp=1, and Kk=1 enables counters in 
executive and supervisor modes only. 


Counter1 Select—refer to Table 5-12. 
Counter2 Select—refer to Table 5-12. 


CTL2<1:0> <11:10> 
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Table 5-12 shows the PMCTR counter select options. 













Table 5-12 PMCTR Counter Select Options 





Counter0 Counter1 Counter2 
SELO<0> SEL1<3:0> 
0:Cycles 0x0: non-issue cycles 

Valid instruction in S3 but none 

issued. 


0x1: split-issue cycles 
Some, but not all, instructions at 
S3 issued. 


0x2: pipe-dry cycles 
No valid instruction at S3. 


1:Instructions 0x2: PC-mispredicts 


0x3: BR-mispredicts 


IntOps issued 0x4: Icache/RFB misses 


OxA: FPOps issued 0x5: ITB misses 

OxB: loads issued 0x6: Dcache LD misses 
OxC: stores issued 0x7: DTB misses 

0xD: Icache issued 0x8: LDs merged in MAF 


(continued on next page) 
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Table 5-12 (Cont.) PMCTR Counter Select Options 


Counter0 Counter1 Counter2 
SELO<0> SEL1<3:0> SEL2<3:0> 


























OxE: Deache accesses 


Measurement Mode Desired 







Program 0 
PAL only 1 
OS only (kernel, 0 
supervisor) : 
User only 1 1 
All except PAL 1 0 
0 0 
User + PAL (not 0 1 
executive, and su 
eS ‘ i 


kill kernel only. The combination Ku=1, Kp=1, and Kk=1 is used to 
he. executive and supervisor modes only. 





Note 


‘Both the user and the operating system can make PAL subroutine 
calls that put the machine in PALmode. The “OS only,” “user only,” 

id “executive and supervisor only” modes do not measure the events 
during the PAL subroutine calls made by the OS or user. The “OS + 
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mode measures the events during the PAL calls made by the usé 
whereas “user + PAL” mode measures the events during the PAL 
made by the OS. 
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5.2 Memory Address Translation Unit (Mbox) IPRs 


The Mbox internal processor registers (IPRs) are described in Section 5. 
through Section 5.2.23. 









Note 


Traps are factored into Mbox IPR write operations unless 
otherwise. 


5.2.1 Dstream Translation Buffer Address Space Numbe 3 
Register 
DTB_ASN is a write-only arth that must. G 


form at. Ss 


Figure 5-27 Dstream Translation Buffer A ress Sp ce Number (DTB_ASN) 
Register ss 





ASN<6:0> 


LJ-03499-TIO 
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5.2.2 Dstream Translation Buffer Current Mode (DTB_CM) Register 


DTB_CM is a write-only register that must be written with an exact : 
of the Ibox processor status (PS) register CM field. These bits i 
current mode of the machine as described in the Alpha Archité eCtiL 
Manual. Figure 5-28 shows the DTB_CM register format. ; 













e (DT 





Figure 5-28 Dstream Translation Buffer Current Mode 


LJ-03500-TI0 
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5.2.3 Dstream Translation Buffer Tag (DTB_TAG) Register 
DTB_TAG is a write-only register that writes the DTB tag and the conten ; 





register when the DTB_TAG register is written. 


The entry to be written is chosen at the time of the DTB_TA 
by a not-last-used replacement algorithm implemented in 


VA<42:13> 





VA<42:13> 


LJ-03501-Ti0 
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5.2.4 Dstream Translation Buffer Page Table Entry (DTB_PTE).Register 


DTB_PTE is a read/write register representing the 64-entry DTB page . 
table entries (PTEs). The entry to be written is chosen by a not. iclast-u id 





DTB. PTE! TEMP register. A 
> (IRF) on a DTB_PTE read 
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Figure 5-30 Dstream Translation Buffer Page Table Entry (DTB_PTE) Register—-Write 
Format 





16 15 14 13 12 11 10 09 08 07 06 05 04 03 02 01 00. : 


LJ-03502-TI0 
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5.2.5 Dstream Translation Buffer Page Table Entry Temporary: 
(DTB_PTE_TEMP) Register 


DTB_PTE_TEMP is a read-only holding register used for DTB_ 
operations of the DTB_PTE require two instructions to return, tt 







the DTB_PTE_TEMP register format. 
Figure 5-31 Dstream Translation Buffer Page Tabie Ent 


Register 


PFN<39:13> 


PFN<39:13> 


PFN<39:13> 


LJ-03503-TI0 
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5.2.6 Dstream Memory Management Fault Status (MM_STAT) Register 


MM_STAT is a read-only register that stores information on Dstream f: 
and Deache parity errors. The VA, VA_LFORM, and MM_STAT regi 
locked against further updates until software reads the VA regis! 
MM_STAT bits are only modified by hardware when the register: 
and a memory management error, DTB miss, or Deache pari 
The MM_STAT register is not unlocked or cleared on reset. Fig 
Table 5—14 describe the MM_STAT register format. 












Figure 5-32 Dstream Memory Management Fault Stat. 


17 16 1110 . 





LJ-03504-TI0 


Dstre 
Extent 










Table 5-14 


'y Management Fault Status Register Fields 
Description 








Set if reference that caused error was a write 
operation. 


Set if reference caused an access violation. 
Includes bad virtual address. 


Set if reference was a read operation and the 
PTE FOR bit was set. 


Set if reference was a write operation and the 
PTE FOW bit was set. 


Set if reference resulted in a DTB miss. 
Set if reference had a bad virtual address. 


(continued on next page) 
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Table 5—14 (Cont.) Dstream Memory Management Fault Status Register 








Fields 
Name Extent Type Description 
RA <10:06> RO RA field of the faulting inst 
OPCODE <16:11> RO Opcode field of the faulting} 
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5.2.7 Faulting Virtual Address (VA) Register 


VA is a read-only register. When Dstream faults, DTB misses, or Deac 
parity errors occur the effective virtual address associated with the fault, * 
miss, or error is latched in the VA register. The VA, VA_FORM,-4' 
STAT registers are locked against further updates until softwa 
register. The VA register is not unlocked on reset. Figure 5-33 
register format. 
















Figure 5-33 Faulting Virtual Address (VA) Register 


31 


LJ-03505-TlO 


5-46 Preliminary Edition-September 1994 


5.2.8 Formatted Virtual neal? (VA_ none) ned ott 







page table base (VA and MVPTBR registers). This is done a a per 
enhancement to the Dstream TBmiss PAL flow. 







VPTB<63:30> 


LJ-03507-TI0 


ddréss (VA_FORM) Register (NT_Mode=0) 


VPTB<63:33> 


VA<42:13> 


LJ-03506-TI0 
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Table 5—15 describes the VA_FORM register fields. 













Table 5-15 Formatted Virtual Address Register Fields 


Name Extent §_ Type Description 
NT_Mode-0 
VPTB <63:33> RO 


VA<42:13> <32:03> RO 







NT_Mode=1 


VPTB <63:30> RO Virtual page . s stored in 
MVPTBR .: 


inal faulting virtual address 


VA<31:13> <21:03> RO Subjet of the 


5-48 Preliminary Edition—-September 1994 


5.2.9 Mbox Virtual Page Table Base Register (MVPTBR) 


MVPTBR is a write-only register containing the virtual address of thé be 
the page table structure. It is stored in the Mbox to be used in. 






31 30 29 


VPTB<63:30> 


LJ-03508-T10 
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5.2.10 Dcache Parity Error Status (DC_PERR_STAT) Register | 
DC_PERR_STAT is a read/write eee that locks and stores Dcache par : 







the register is locked against further 1 updates epee for the SE 
software writes a 1 to clear the LOCK bit. 


unlock and clear the bit. The SEO bit is not s 
detected on both pipes within the same cycle. : 
eal dee Deache parity error status bits i if 


LJ-03509-TI0 
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Table 5-16 Dcache Parity Error Status Register Fields 
Name Extent Type Description 


SEO <00> W1C Set if second Dcache parity erre: 
cycle after the register was = ed. 





Preliminary Edition-September 1994 5-51 


5.2.11 


5.2.12 











Dstream Translation Buffer Invalidate All Process (DTBIAP) 
Register 
DTBIAP is a write-only register. Any write operation to this regi 


invalidates all data translation buffer (DTB) entries in which t 
space match (ASM) bit is equal to zero. 


Dstream Translation Buffer Invalidate All (DTBIA) Regi ed 


DTBIA is a write- only register. Any write operation to thi: 
all 64 DTB entries, and resets the DTB not-last-used (NLU 
initial state. 


ster 
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5.2.13 Dstream Translation Buffer invalidate Single (DTBIS) Register 


DTBIS is a write-only register. Writing a virtual address to this regk : 
invalidates the DTB entry that meets either of the following cri 


¢ <A DTB entry whose VA field matches DTBIS<42:13> ant 
matches DTB_ASN<63:57>. 


¢ A DTB entry whose VA field matches DTBIS<42: 13> 2 an 
set. 










Figure 5-38 shows the DTBIS register format. 


Figure 5-38 Dstream Translation Buffer In 


VA<42:13> 
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5.2.14 Mbox Control Register (MCSR) 


MCSR is a read/write register that controls features and records status 
the Mbox. This register is cleared on chip reset but not on timeout, reset. ° 
Figure 5-39 and Table 5-17 describe the MCSR format. 


Figure 5-39 Mbox Control Register (MCSR) 


31 






| BIG_ENDIAN 
-SP<1:0> 

MBZ 

E BIG_ENDIAN 
MBZ 


LJ-03511-TIO 
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Table 5-17 
Name 
M_BIG_ 
ENDIAN 


SP<1:0> 


Reserved 


E_BIG_ 
ENDIAN 


Reserved . 








Mbox Control! Register Fields 


Extent Type Description 

<00> RW,0 Mbox Big Endian mode enable. Wher 
physical address is inverted for all:i6 
references. 

<02:01> RW,0 areas mode enables. 










<03> RW,0 
<04> RW,0 





the shifter operation. 


<05> = Srved to Digital. Must be zero (MBZ). 
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5.2.15 Dcache Mode (DC_MODE) Register 


DC_MODE is a read/write register that controls diagnostic and test modes n 
the Dcache. This register is cleared on chip reset but not on time 
Figure 5-40 and Table 5—18 describe the DC_MODE register fornii 










Note 
The following bit settings are required for normal operation: 


DC_ENA = 1 

DC_FHIT = 0 
DC_BAD_PARITY = 0 
DC_PERR_DISABLE = 0 


20403 02 01 00 


DC_FHIT 
DC_BAD_PARITY 
DC_PERR_DISABLE 


LJ-03512-T10 


5-56 Preliminary Edition-September 1994 


Table 5-18 Dcache Mode Register Fields 


Name 
DC_ENA 


DC_FHIT 


DC_BAD_ 
PARITY 


DC_PERR_ 
DISABLE 


Extent Type 
<00> RW,0 


<01> RW,0 


<02> 

















Description 


Software Deache enable. The: €: 

enables the Dcache unless:the Deache: 
been disabled in hardware DC_DOAi 
(The Deache is enabled if:BG. 

DOA=0). When clear. me 





ais has the effect of putting bad data 
£ 2 Dcache on integer stores that 


Wicdting point store instructions should not be 


sgued when this bit is set because it may result 


det, buffer. Must be zero (MBZ) in normal 
operation. 


When set, the DC_PERR_DISABLE bit disables 
Deache parity error reporting. When clear, 

this bit enables all Deache tag and data parity 
errors. Parity error reporting is enabled during 
all other Dcache test modes unless this bit is 
explicitly set. Must be zero (MBZ) in normal 
operation. 
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5.2.16 Miss Address File Mode (MAF_MODE) Register 











describe the MAF_MODE register format. 


Note 
The following bit settings are required for normal 


DREAD_NOMERGE = 0 
WB_FLUSH_ALWAYS = 0 
WB_NOMERGE = 0 

MAF _ARB_ DISABLE = 0 
WB_CNT_DISABLE = 0 





DREAD_NOMERGE 
WB_FLUSH_ALWAYS 


WB_CNT DISABLE 

MAF_ARB DISABLE 
DREAD_PENDING (Read-only) 
WB_PENDING (Read-only) 


LJ-03513-T10 
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Table 5-19 Miss Address File Mode Register Fields 


Name 


DREAD_ 
NOMERGE 


WB_FLUSH_ 
ALWAYS 


WB_ 
NOMERGE 


IO_NMERGE 


WB_CNT_ 
DISABLE 


MAF_ARB_ 
DISABLE 


DREAD _ 
PENDING 



















Extent Type Description 
<00> RW,0 
entry. Subsequent merging to that 
(even if DREAD _ NOMERG ise] 
<01> RW,0 
<02> RW,0 
<03> RW,0 







<04> ¥ disables the pies WB counter in 


bei ot The top entry of the WB arbitrates 







<05> 







in the MAF arbiter. WB_Reissue, Replay, Iref and MB 
uests are not blocked from arbitrating for the Scache. 
This bit is cleared on both timeout and chip reset. Must 
be zero (MBZ) in normal operation. 


Indicates the status of the MAF DREAD file. When set, 
there are one or more outstanding DREAD requests 

in the MAF file. When clear, there are no outstanding 
DREAD requests. 


This bit indicates the status of the MAF WB file. When 
set, there are one or more outstanding WB requests in 
the MAF file. When clear, there are no outstanding WB 
requests. 
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5.2.17 


5.2.18 


Deache Flush (DC_FLUSH) Register 
DC_FLUSH is a write-only register. A write operation to this register cleg 
all the valid bits in both banks of the Dcache. 
Alternate Mode (ALT_MODE) Register 


ALT_MODE is a write-only register that specifies the alternaté gro 
used by some HW_LD and HW_ST instructions. Figure 542 and: 
describe the ALT_MODE register format. 













Figure 5-42 Alternate Mode (ALT_MODE) Register | 


LJ-03514-TI0 
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5.2.19 Sycle Sounter (CC) Register 











instruction that enables the counter reads. 
initial count. 








The CC register is disabled on chip reset. 
format. 


Figure 5-43 Cycle Counter (¢ C 


CC, OFFSET 


LJ-03515-TI0 
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5.2.20 Cycle Counter Control (CC_CTL) Register 






COUNT<31:04> 


CcC_ENA 


LJ-03516-TI0 









oe G vole count. This value is loaded into 
CC<31:04>. 


Cycle Counter enable. When set, this bit 
enables the CC register to begin incrementing 
3 cycles later. An RPCC issued 4 cycles after 
CC_CTL<32> is written “sees” the initial count 
incremented by 1. 


COUNT<31:04> < 


CC_ENA 
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5.2.21 Dcache Test Tag Control (DC_TEST_CTL) Register 


DC_TEST_CTL is a read/write register used exclusively for testing 
diagnostics. An address written to this register is used to index. sie the 
_ array when reading or writing to the DC_TEST_TAG registe ire. 
and Table 5—22 describe the DC_TEST_CTL register form 
- describes how this register is used. 














Figure 5-45 Dcache Test Tag Control (DC_TEST_CTL) Reg 


INBEX<12:3> 


BANKO 
BANK1 
IGN/RAZ 


LJ-03517-T1O 


‘Deache BankO enable. When set, reads from DC_ 
TEST_TAG return the tag from Dcache bank0, 
writes to DC_TEST_TAG write to Dceache bankO. 
When clear, reads from DC_TEST_TAG return 
the tag from Dcache bank1. 


Deache Bank1 enable. When set, writes to DC_ 
TEST_TAG write to Deache bank1. This bit has 
no effect on reads. 


Deache tag index. This field is used on reads 
from and writes to the DC_TEST_TAG register to 
index into the Deache tag array. 
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5.2.22 Dcache Test Tag (DC_TEST_TAG) Register 


DC_TEST_TAG is a read/write register used exclusively for testing and 
diagnostics. When DC_TEST_TAG is read, the value in the DC_TEST_CT 
register is used to index into the Dcache. The value in the tag, tax rity. 
and data parity bits for that index are read out of the Deache aiid 
the DC_TEST_TAG_TEMP register. A zero value is returned: : 
register file (IRF). If BANKO is set, the read operation is from Deg 
Otherwise, the read poke is from Deache bank1. 






TAGS: 13> 


TAG_PARITY 
OWO_VALID 
OW1_VALID 


TAG<38:13> 


LJ-03518-TI0 
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Table 5-23 Dcache Test Tag Register Fields 





Name Extent Type Description 

TAG_ <02> WO Tag parity. This bit refers to, thi 

PARITY parity bit that covers tag bits: 
(valid bits not covered). ; 

OWO_VALID- <ll> wo Octaword valid bit 0. This bit 


Deache valid bit for 1] w-ord 
within a Deache 32-3 








OW1_VALID <i12> wo 


TAG<38:13> <38:13> WO 
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5.2.23 Decache Test Tag Temporary (DC_TEST_TAG_TEMP) Registe 


DC_TEST_TAG TEMP is a read-only register used exclusively for testin 
diagnostics. 


Reading the Dcache tag array requires a two-step read process: : 










Figure 5-47 and Table 5-24 describe the DC_T 
format. 


Figure 5-47 Dcache Test Tag Temporary (DG 


13 12 11 





31 





TAG_PARITY 
DATA_PARO<0> 
DATA_PARO<1> 
DATA_PAR1<0> 
DATA_PAR1<1> 
OW0_VALID 
OW1_VALID 


TAG<38:13> 





LJ-03519-TIO 
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Table 5-24 Dcache Test Tag Temporary Register Fields 
Name Extent Type Description 


TAG_PARITY <02> RO 


















Tag parity. This bit refers to the Desshie 
DATA_PARO<0> <03> RO 
DATA_PARO<1> <04> RO 
DATA_PAR1<0> <05> RO 
DATA_PAR1<1>  <06> RO 

~ OW0_VALID <1l> RO 


OW1_VALID <12> it 1. This bit refers to the Deache valid 


high-order octaword within a Dcache 32-byte 





TAG <38:13> <38:13>- 


Preliminary Edition-September 1994 5-67 


5.3 External Interface Control (Cbox) IPRs 
























Table 5—25 lists specific IPRs for controlling Scache, Bcache, system 
configuration, and eae error information. These IPRs cannot 


Section 5.3.9. 






Table 5-25 Cbox internal Processor Register:D 


Register Address 

SC_CTL FF FFFO 00A8 ior. 
SC_STAT FF FFFO 00E8 

SC_ADDR FF FFFO 0188 


BC_CONTROL € nel Beahaercean interface 
and Beache testing. 

BC_CONFIG Contains Beache configuration 
parameters. 

BC_TAG ADDR Contains tag and control bits for 


FILLs from Bcache. 
Logs Bcache/system-related errors. 


Contains the address for 
Bcache/system-related errors. 


ELSTAT 
EI_ADDR 


Contains fill syndrome or parity 
bits for FILLs from Bcache or main 
memory. 
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5.3.1 Scache Control (SC_CTL) Register 


SC_CTL is a read/write register that controls Scache activity. Figure*6=4 
Table 5-26 describe the SC_CTL register format. The bits in this regis iy are 
initialized to the value indicated in Table 5—26 on reset, but.si6€: 

reset. 









Figure 5-48 Scache Control (SC_CTL) Register 


1918 1615 131211 08 07 





SC_SET_EN<2:0> 
Reserved 


LJ-03520-TI0 
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Table 5-26 Scache Control Register Fields 


Field Extent Type 
SC_FHIT <00> RW,0 
SC_FLUSH <01> RW,0 
SC_TAG_ <07:02> RW,0 
STAT<5:0> 
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, Status<5:0> 
" $C_TAG_ 









Description 
When set, this bit forces i : 
Noncacheable i ea : 


hit in the Scache and will bé% 
In this mode, only « 








is ueePonly j in the SC_FHIT mode 
combination of tag status and 

unity bits'inithe Scache. The parity bit can 
to write bad tag parity. The correct 















e eile tae bits must be zero for normal 
tion: 


Scache Tag 
Description 


Tag parity, valid, 


STAT <5:2> shared, dirty; 
bits 7, 6, 5, and 4 
respectively 
SC_TAG _ Octaword modified 
STAT<1:0> bits 


(continued on next page) 









Table 5-26 (Cont.) Scache Control Register Fields 
Field Extent Type Description 


SC_FB_DP<3:0> <11:08> RW,0 Force bad parity—This fiel 
bad data parity for these 
within the octaword 


“B_DP<0> corresponds to 
LW4. This bit field must be zero 
Sau 











SC_BLK_SIZE <12> 


_ of the selected block size. At power-up time, 
this bit is set and the default block size 
is 64 bytes. When clear, the block size is 
32 bytes. This bit must be set to the desired 
value to reflect the correct Scache/Beache 
block size before the 21164 does the first 
cacheable read or write transaction from 
Beache or system. 


This field is used to enable the Scache sets. 
Only one or all three sets may be enabled 
at a time. Enabling any combination of two 
sets at a time results in UNPREDICTABLE 
behavior. 


“<18:16> RW,0 Reserved to Digital. Must be zero (MBZ). 
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5.3.2 Scache Status (SC_ STAT) Register 


SC_STAT is a read-only register. It is not cleared or unlocked by feck.” s 
PALcode read of this register unlocks SC_ADDR and SC_STAT and clears 
SC_STAT. 


If an Scache tag or data parity error is detected during an Se; 
the SC_STAT register is locked against further updates from * 
transactions. Figure 5-49 and Table 5—27 describe the SC TAT: 
format. 











Figure 5—49 Scache Status (SC_STAT) Register 





SC_TPERR<2:0> 
SC_DPERR<7:0> 
SC_CMD<4:0> 
SC_SCND_ERR 


LJ-03521-TI0 
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Table 5-27 Scache Status Register Fields 


Field 
SC_TPERR<2:0> 


SC_DPERR<7:0> 


SC_CMD<4:0> 


SC_SCND_ERR 








Extent Type Description 
<02:00> RO When set, these bits at 


<10:03> RO 













Ww. : ine 
‘had the data 


ng ap had a data parity 
HIT: ‘8C CTL<00>) is set, 


<15:11> 


TREAD, or WRITE command from the — 

‘box, Scache victim command, or the 
system command being serviced. Refer to 
Table 5—28 for field encoding. 


When set, this bit indicates that an Scache 
transaction resulted in a parity error 
while the SC_TPERR or SC_DPERR 

bit was already set from the earlier 
transaction. This bit is not set for two 
errors in different octawords of the same 
transaction. 
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Table 5-28 SC_CMD Field Descriptions 


SC CMD Source<15:14> 
1x 


00 
01 
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SC CMD Encoding<13:11> Description 
ao = 


101 
100 


001 
001 
001 
011 





ache DWRITE 


5.3.3 Scache Address (SC_ADDR) Register 


SC_ADDR is a read-only register. It is not cleared or unlocked by resé 
address is loaded into this register every time the Scache is accessed 
of the error bits in the SC_STAT register is not set. If an Scat . 
















This register is ‘nlocked whenever SC_STAT is read. 


For Scache read transactions, address bits <39:04> ar 


éd for storing the 
: Scache, there are unique valid, - 


(16 bytes). There is a single tag a 
subblocks. In force hit mode, of 


of 5-cycle spacing is required 
bread of the SC_ADDR or SC_, STAT. 
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Figure 5-SO Scache Address (SC_ADDR) Register 






Normal Mode 
31 







SC_ADDR<38:04> 










Force Hit Mode 
31 151413 






TAG<38:15> : Sted 





o| TAG<38-15 
RAZ 
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Table 5—29 Scache Address Register Fields 
Name Extent Type Description 







Normal Mode 


SC_ADDR<38:04> <38:04> RO | Scache address. 

















Force Hit Mode 





TP <04> Scache tag parity b 
vo <05> i 
SO <06> 

DO <07> 

V1 <08> 

Sl <09> 

D1 <10> 

Mo <12,11> 

M1 <14,13> 


TAG<38:15> <38:15> 
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5.3.4 Bcache Control (BC_CONTROL) Register 


BC_CONTROL is a write-only register. It is used to enable and control 
external Bcache. Figure 5—51 and Table 5-30 describe the BC_CO] ROL 
register format. The bits in this register are initialized to the val : 

in Table 7—2 on reset, but not on timeout reset. 


Figure 5-51 Bcache Control (BC_CONTROL) Register 


31. «29 28 27 26 25 24 191817 16 15 1413 12 08 07 06 05 04 o3'g2:01 : 


I esd | ee | 
P - = a 





BC_ENABLED 

wc ALLOC_CYC 

EL CMD_ GRP2 

El CMD. GRP3 
CORR_FILL_DAT 
VTM_FIRST 

EL ECC. OR_PARITY 
BC_FHIT 

BC _ TAG_STAT<4:0> 
BC | BAD . DAT 

El DIS ERR 

PIPE_| LATCH 
BC_WAVE<1 10> 
PM_MUX_SEL<5:0> 


MBZ 
FLUSH_SC_VTM 
MBZ 
DIS_SYS_PAR 
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Table 5-30 Beache Control Register Fields 


Field 
BC_ENABLED 


ALLOC_CYC 


EI_CMD_GRP2, 


EI_CMD_GRP3 


CORR_FILL_DAT 











Extent Type Description 
<00> WO,0 


<01> WO,0 


<02> WwO,0 


<03> 
Vinterface command pins to be acknowledged 
-by the system interface. When clear, the MB 


,mand is not driven to the command pins. 


Correct fill data from Bcache or main memory, in 
“ECC mode. When set, fill data from Beache or main 
memory first goes through error correction logic 
before being driven to the Scache or Deache. If the 
error is correctable, it is transparent to the system. 


When clear, fill data from Beache or main memory is 
driven directly to the Deache before an ECC error is 
detected. If the error is correctable, corrected data is 
returned again, Deache is invalidated, and an error 
trap is taken. 


(continued on next page) 
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Table 5-30 (Cont.) 


Field Extent Type 
VTM_FIRST <05> WO,1 
EI ECC_OR_ <06> WO,1 
PARITY 

BC_FHIT <07> WO,0 
BC_TAG_ 

STAT <4:0> 
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Bcache Control Register Fields 



























Description 


This bit is set for systems without a 
On a Beache miss, the 21164 first d 
victimized block’s address on the s¥s 
followed by the read miss address 
This bit is cleared for systems with 
On a Beache miss with victixt th: 
out the read miss folowe: 
command. 


When set, the 21164 genera 
ECC on the data check. ins. 
generates or expects Ma 
check pins. 







and index are the address received by 
> Beache tag RAMs are written with the 





&ro during normal operation. 


.. This*hit, field is used only in BC_FHIT=1 mode to 


ite any combination of tag status and parity bits 
th e Beache. The parity bit can be used to write 
‘d tag parity. These bits are UNDEFINED on 
reset. This bit field must be zero during normal 


“operation. The field encoding is as follows: 


(continued on next page) 











Table 5-30 (Cont.) Bcache Control Register Fields 
Field Extent Type Description 


Bcache Tag Status . 
Bit Descript L 


BC_TAG_STAT<4> 
BC_TAG_STAT<3> 2:Pa 


BC_TAG STAT<2> 
BC_TAG_STAT : 


BC_BAD_DAT <14:13> WO,0 







mn set, this bit causes the 21164 to ignore 

'y ECC (parity) error on fill data received from 
the Bcache or main memory; or Beache tag or 
‘control parity error. It also ignores a system 
command/address parity error. No machine check is 
taken when this bit is set. 


When set, this bit causes the 21164 to pipe the 
system control pins (addr_bus_req_h, cack_h, and 
dack_h) for one system clock. 


(continued on next page) 


KI_DIS_ERR 





PIPE_LATCH 
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Table 5-30 (Cont.) Bcache Control Register Fields 


Field Extent Type 
BC_WAVE<1:0> <18:17> WO,0 






Description 


ie : : : The bits in this field determine oul ump 








PM MUX. 
SEL<5:0> 


Reserved 
FLUSH_SC_VTM 
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private read transactions of the B 
pipelining cannot be used in 32-by 


To enable wave pipelining, 
should be set to the latengy: 
read. BC_ CONTROL<18;, > sh 





Flush Scache victim buffer. For systems without 


“a Beache, when this bit is clear, the 21164 flushes 
the on-chip victim buffer if it has to write-back any 


entry from the victim buffer. When this bit is set, 
the 21164 writes only one entry back from the victim 
buffer as needed. 


For systems with a Bcache, this bit must always be 
clear. At power-up this bit is initialized to a value of 
0. 


Reserved—MBZ. 


(continued on next page) 


Table 5-30 (Cont.) Bcache Control Register Fields 
Field Extent Type Description 




















DIS_SYS_PAR <28> WO,0 When set, the 21164 does not che 
system command/address bus 
parity will still be generated.., 


Table 5-31 describes the PM_MUX_SEL fields. 


Table 5-31 PM_MUX_SEL Register Fields 
PM_MUX_SEL<21:19> Counter 1 


0x0 . Scache accesses 
0x1 3 
0x2 
0x3 
0x4 
0x5 
0x6 









ache write misses 
Séache shared write operations 
Scache write operations 
Beache misses 

System invalidate operations 


System read requests 
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5.3.5 Bcache Configuration (BC_CONFIG) Register 


BC_CONFIG is a write-only register used to configure the size and speed: 6f 
the external Bcache array. The bits in this register are initialized to the va 
indicated in Table 5-32 on reset, but not on timeout reset. Figure” 
Table 5-32 describe the BC_CONFIG register format. 













Figure 5-52 Bcache Configuration (BC_CONFIG) Register 


29 28 201918 161514 1211 08 07 04 03 0 
a LER ere 


BC RD_SPD<3:0> 
So BE _WR_SPD<3 :0> 
= BC_RD_WR_SPC<2:0> 


MBZ 
FILL_WE_OFFSET<2:0> 
MBZ 


MLO-012926 
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Table 5-32 Bcache Configuration Register Fields 
Field Extent Type Description 


BC_SIZE<2:0> <02:00> WO,1 The bits in this field 
| the size of the Bcache:At pov 

field is initialized t I 

a 1M-byte Bcache. “f} 

follows: 













field encoding 





BC_ : 
SIZE<2:0>* 


dd Bcache size 


Reserved 
BC_RD_SPD<3:0> . 


= Must be zero (MBZ). 


The bits in this field are used to indicate 
to the BIU the read access time of the 
Beache, measured in CPU cycles, from the 
start of a read transaction until data is 
valid at the input pins. The Beache read 
speed must be within 4 to 10 CPU cycles. 
At power-up, this field is initialized to a 
value of four CPU cycles. 


For systems without a Bceache, the read 
speed must be equal to the sysclk to CPU 
clock ratio. 


The Bcache read and write speeds 
must be within three cycles of each 
other (absolute value = (BC_RD_SPD — 
BC.WR.SPD) < 4). 


(continued on next page) 
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Table 5-32 (Cont.) Bcache Configuration Register Fields 


Field Extent Type 
BC_WR_SPD<3:0> <11:08> WO,4 
BC_RD_WR_ <14:12> WO,7 
SPC<2:0> 
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Description 


The bits in this field are used: 
to the BIU the write time.if 
measured in CPU cycles,. 
speed must be within 4 
At power-up, this field i 
value of four CPU 


peaches write transactions, in addition to 
the. number of CPU cycles specified by 
this field. The maximum value (BC_RD_ 
WR_SPC+3) should not be greater than 
the Beache READ speed when Bcache is 
enabled. 


At power-up, this field is initialized to a 
read/write spacing of seven CPU cycles. 


Must be zero (MBZ). 








(continued on next page) 


Table 5-32 (Cont.) Bcache Configuration Register Fields 












Field Extent Type Description 
FILL_WE_ <18:16> . WO,1 Bcache write-enable pulse fset, 
OFFSET<2:0> the sys_clk_outn_x eiige, forth; 


transactions from the:system. This'field 
does not affect privé 
to Beache. It is used @ i 


sclk ratio is at this field 
cS ry than 3. At power-up, 
this eld i is initialized to a write offset 













Reserved <19> WoO,0 


BC_WE_CTL<8:0> <28:20> 3 
used to'control the timing of the write- 
Hable during a write or FILL transaction. 
Ifthe bit is set, the write pulse is asserted. 
If the bit is clear, the write pulse is not 
asserted. Each bit corresponds to a CPU 

* cycle. 





For private Beache write and shared-write 
transactions, this field is used to assert the 
write pulse without any offset. 

For FILLs to the Bcache, the FILL_WE_ 
OFFSET<18:16> field determines the 
number of CPU cycles to wait before 


asserting the write pulse as programmed 
in this field. 


At power-up, all bits in this field are 
cleared. 


<63:29> WO Ignored. 
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5.3.6 Bcache Tag Address (BC_TAG_ADDR) Register 


BC_TAG_ADDR is a read-only register. Unless locked, the BC_TAG. ADI 
register is loaded with the results of every Beache tag read. When. 


TAG<38:20> 


TAG_P 
BC. TAG<38:20> 


39 38 32 


TAG<38:20> 


BC_TAG<38:20> 
LJ-03526-TIO 
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Tabie 5-33 Bcache Tag Address Register Fields 


Field 
HIT 


TAGCTL_P 


TAGCTL_D 
TAGCTL_S 
TAGCTL_V 
TAG_P 
BC_TAG<38:20> 


Extent 
<12> 


<13> 


<14> 
<15> 
<16> 
<17> 
<38:20> 


Type 
RO 


RO 


RO 
RO 
RO 
RO 
RO 









Description 


If set, Beache access res’ 
the Bcache. ° 


Value of the parity: 
status bits. 


Value of the B eh 
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5.3.7 External Interface Status (EI_STAT) Register é 
ELSTAT is a ent register. Any PALcode read access of this ches 








register is not unlocked or cleared by reset. 













Fill data from Bcache or main memory could have correcta 
uncorrectable (u) errors in ECC mode. In oe mode, fill data p 


FILL. _ SYN, and EL STAT is as follows: 


1. Read EJ_ADDR, BC_TAG, and FILL SYN i in: 
clear any aimee 


Loading and locking rules for exter! 
Table 5-34. 


uncorrectable error’ 
error follow. 
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Table 5-34 Loading and Locking Rules for External Interface Registers 


Load Lock 
Register Register 





























Uncorrectable Second Hard 
Error Error 


Correctable 
Error 


0 0 Not possible No No 
1 | 0 Not possible Yes 
0 1 0 Yes 
1} 1 0 Yes 
0 1 1 No 
1! 1 1 No 


Software must reexecute the IPR read sequence. On ao sey 
all the related IPRs are unlocked, and EI_STAT is cl 


3130 29 28 27 24 23 


00 


CHIP_!ID<3:0> 
BC_TPERR 
BC_TC_PERR 
El_ES 
COR_ECC_ERR 





36 35 34 33 32 


UNC_ECC ERR 
El_PAR_ERR 
FIL_IRD 
SEO_HRD_ERR 


LJ-03524-TI0 
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Table 5-35 El STAT Register Fields 




















Field Extent Type Description 

CHIP_ID<3:0> <27:24> RO Read as “2.” Future update revisions to 
return new unique values. 

BC_TPERR <28> RO Indicates that a Bceache read transac 


bad Pe in the tag address RAM. 
BC_TC_PERR <29> RO 


ELES «30> =: RO 


COR_ECC_ERR <31> RO 


UNC_ECC_ERR <32> RO 


EI_PAR_ERR <33>255 


FIL_IRD 











the error occurred during a D-ref FILL. 


This bit is not defined for a Bcache tag error 
(BC_TPERR) or a Beache tag control parity error 
(BC_TC_ERR). 


Second external interface hard error. This bit indicates 
that a FILL from Beache or main memory, or a system 

address/command received by the CPU has a hard error 
while one of the hard error bits in the EI_STAT register 
is already set. 
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5.3.8 External Interface Address (El_ADDR) Register 
El _ADDR is a read-only register that contains the physical address 





E! ADDR<39:4> 


LJ-03525-T10 
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5.3.9 Fill Syndrome (FILL_SYN) Register 










FILL_SYN is a 16-bit read-only register. It is loaded but not locked on 
correctable ECC error, so that another correctable error does not r i 
loaded and locked if an uncorrectable ECC error or parity error . 


FILL_SYN register is unlocked when the EI_ STAT register ig: 
register is not unlocked by reset. 


If the 21164 is in ECC mode and an ECC error is reco, 
transaction, the syndrome bits associated with the pag 


indicate the bytes within the lower 
FILL_SYN<15:08> is set to indicaté@:th 
quadword. Figure 5-56 shows the:FILL_S 






register format. 
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Figure 5-56 Fill Syndrome (FILL_SYN) Register 
























31 16 15 


-03527-T10 





Table 5-36 lists the syndromes associated with.correctable sinigle-bit errors. 


Data Bit Syndrome,¢ 





00 CE 01 
01 CB 02 
02 D3 04 
03 D5 08 
04 D6 10 
05 : 20 
06 40 
07 07 80 
08 
09 
10 
i 


(continued on next page) 
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Table 5—36 (Cont.) Syndromes for Single-Bit Errors 
Data Bit Syndromei. Check Bit Syndrome, 






























18 18 

19 ib 

20 16 

21 19 

22 1A 
23 1C 
24 | E3 
25 E5 
26 E6 
27 EQ 
28 EA 
29 EC 
30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 


41 
42 


(continued on next page) 
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Table 5-36 (Cont.) Syndromes for Single-Bit Errors 
Data Bit Syndrome. Check Bit Syndrome. 























47 B5 
48 8F 
49 8A 
50 92 
51 94 
52 97 
53 98 
54 9B 
55 9D 
56 62 
57 64 ° 
58 67 
59 68 
60 6B 
61 6D 
62 

63 
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5.4 PAL Storage Registers 


The 21164 Ebox register file has eight extra registers that are called th 
PALshadow registers. The PALshadow registers overlay R8 througe 14 : 








consider R8 through R14 and R25 as local scratch. PALshado ‘regi 
not be written in the last two cycles of a PALcode flow. The né 
CPU is ICSR<SDE> = ON. PALcode disables SDE for the unali 






PALtemp read operation to availability i is one cycl 
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5.5 Restrictions 


The following sections list all known register access restrictions. 


5.5.1 Cbox IPR PAL Restrictions | 
Table 5-37 describes the Chox IPR PAL restrictions. 
























Table 5-37 Cbox IPR PAL Restrictions 


Condition pescrey. 


Store to SC_CTL, BC_CONTROL, BC_ ust be followed 
CONFIG except if no bit is changed other urrent cacheable 
than BC_CONTROL<ALLOC_CYC>, 

BC_CONTROL<PM_MUX_SEL>, or BC_ 
CONTROL<DBG_MUX_SEL>. 


Store to BC_CONTROL that only #: 
changes bits BC_CONTROL<ALLOC_ 
CYC>, BC_CONTROL<PM_MUX_SEL>, 
or BC_CONTROL<DBG_MUX_SEL>. 


Load from SC_STAT. 
Load from EI_STAT. 


Unlock SC_ADDR and SC_STAT 
Uislocks EL_ADDR, El STAT, FILL. SYN, 


and:BC_TAG ADDR. 
No LDx_L or STx_C. 


o store instructions. 


Any Cbhox IPR address. 
Any undefined C ae = 





No STx_C to cacheable space. 


Must be followed by MB, read operation 
of SC_STAT, then MB prior to subsequent 
store. 


Must be followed by MB, read operation 
of EI_STAT, then MB prior to subsequent 
store. 


BC_CONTROL<01> (ALLOC_CYCLE) 
must be clear. 
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5.5.2 PAL Restrictions—Instruction Definitions 


Mbox instructions are: LDx, LDQ_U, LDx_L, HW_LD, STx, STQ_U, STx 
HW_ST, and FETCHx. 


Virtual Mbox instructions are: LDx, LDQ_U, LDx_L, HW_LD (¥ 
STQ_U, STx_C, HW_ST (virtual), and FETCH«x. 


Load instructions are: LDx, LDQ_U, LDx_L, and HW_LD. 
Store instructions are: STx, STQ_U, STx_C, and HW_S 
Table 5-38 lists PALcode restrictions. 












Table 5-38 PAL Restrictions Table 
Y if checked 














The following in cycle 0: by Pvc! 

CALL_PAL entry Y 
Y 

PALshadow write instruction Y 

HW_LD, lock bit set 

HW_LD, VPTE bit set 

Any load instruction Y 
Y 
Y 

Any store instruction Y 
Y 
Y 
Y 
Y 


TPR DTBIS not aborted in 0,1. 


Only one HW_REI_STALL in an aligned block of four 
instructions. 


(continued on next page) 
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Table 5-38 (Cont.) PAL Restrictions Table - 

































The following in cycle 0: Restrictions (Note: Numbers refer to cycle number): 
HW_MTPR any undefined IPR Illegal in any cycle. 
number 


ARITH trap entry No HW_MFPR EXC_SUM or EXC_MASK in c 
Machine check trap entry No a file read or write access in 0,1,2,3,4,5,6, 


HW_MTPR any Ibox IPR 
(including PALtemp registers) 


HW_MTPR ASTRR, ASTER 


HW_MTPR SIRR 
HW_MTPR EXC_ADDR 
HW_MTPR IC_FLUSH_CTL 
HW_MTPR ICSR: HWE No HW_REI i 
HW_MTPR ICSR: FPE 


rR HR 


instructions. 


~~ 


then no‘HW_REL STALL in 0,1. 
HW_REI in 0,1,2,3,4. 


HW_MTPR ICSR: SPE, FMS 


rire 


HW_MTPR ICSR: SPE 
HW_MTPR ICSR: SDE 


HW_MTPR ITB_ASN 


Kd 


HW_MTPR ITB_PTE 


HW_MTPR ITB_IAP I 

ITB_IA . 

HW_MTPR ITB.. EW_REISTALL must be in the same [stream octaword. 
No HW_MFPR IFAULT_VA_FORM in 0,1,2. 


No CALL_PAL in 0,1,2,3,4,5,6,7. 
No HW_REI in 0,1,2,3,4,5,6. 


No HW_REI in 0,1,2. 
No private CALL_PAL in 0,1,2,3. 


KKK OK 





(continued on next page) 
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Table 5-38 (Cont.) PAL Restrictions Table 


























The following in cycle 0: Restrictions (Note: Numbers refer to cycle number): 


HW_MTPR CC, CC_CTL No RPCC in 0,1,2. 
No HW_REI in 0,1. 
HW_MTPR DC_FLUSH No Mbox instructions in 1,2. 
No outstanding fills in 0. 
No HW_REI in 0,1. , 
HW_MTPR DC_MODE No Mbox instructions in 1,2,3,4. 


No HW_MFPR DC_MODE in 1,2. 
No outstanding fills in 0. 

No HW_REI in 0,1,2,3. 

No HW_REI_STALL in 0,1. 


HW_MTPR DC_PERR_STAT —__ No load or store instructions in 
No HW_MFPR DC _PERR _ STA’ 








mM KK 


HW_MTPR DC_TEST_CTL 


HW_MTPR DC_TEST_TAG 





Y 
HW_MTPR DTB_ASN Y 
Y 
HW_MTPR DTB_CM, ALT_ Y 
‘MODE Y 
HW_MTPR DTB_PTE Y 
Y 
HW_MTPR DTB_TAG Y 
Y 
Y 
Y 
Y 
HW_MTPR DTBIAP, DTBIA: Y 
Y 
Y 
fiw _MEPR DTB_PTE in 1. Y 
No Mbox instructions in 1,2,3. Y 
No WMB in 1,2,3. Y 
No HW_MFPR MAF MODE in 1,2. Y 
No HW_RETI in 0,1,2. Y 
ication checker 
(continued on next page) 
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Table 5-38 (Cont.) PAL Restrictions Table 












The following in cycle 0: Restrictions (Note: Numbers refer to cycle number): 


HW_MTPR MCSR No virtual Mbox instructions in 0,1,2,3,4. 
No HW_MFPR MCSR in 1,2. 
No HW_MFPR VA_FORM in 1,2,3. 
No HW_REI in 0,1,2,3. 
No HW_REI STALL in 0,1. 


HW_MTPR MVPTBR No HW_MFPR VA_FORM in 1,2. 
HW_MFPR ITB_PTE No HW_MFPR ITB_PTE_TEMP in 1 
HW_MFPR DC_TEST_TAG No outstanding DC fills in 0. 

o HW_! 


HW_MFPR DTB_PTE 


1PALcode verification checker 
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This chapter describes the 21164 privileged archite 
(PALcode). The chapter is organized as follows: 


e PALcode description 

° PALmode environment 

¢ Invoking PALcode 

e PALcode entry points 

¢ Required PALcode functio 


e Alpha 21164 implementat 


hitecturally reserved opcodes 
instructions ee 


is common across Al: 


PALcode diff 


System hardware exceptions (MCHK, ARITH) 


Memory-management exceptions 


CALL_PAL instructions 
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PALcode has characteristics that make it appear to be a combination of: 
microcode, ROM BIOS, and system service routines, though the analogy't6:..any 
of these other items is not exact. PALcode exists for several major reasons? 







There are functions that must run “aioinieally 9 
instructions that may need complete access. 6 


In each of these cases, E 
routines are nothia 








pe buffer fill routines, Istream mapping ae cannot 
fabled. Dstream mapping is still enabled. : 


‘Fhe prograsit has privileged access to all the computer hardware. Most of 
£he functions handled by PALcode are privileged and need control of the 
west levels of the system. 








a Interrupts are disabled. If a long sequence of instructions need to be 
uted atomically, interrupts cannot be allowed. 
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21164 the special PALmode-only instructions perform the fe 


Read or write internal processor registers (HW. 


PALcode is invoked , entry points, under certain well-defined 
conditions. hese.entry: ints provide access to a series of callable routines, 


z rdware segs i this and behaves as if the PC were still longword aligned 
for the purposes of Istream fetch and execute. On HW_REI, the new state of 
PALmode is copied from EXC_ADDR<00>. 
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When an event occurs that needs to invoke PALcode, the 21164 first drains 
the pipeline. The current PC is loaded into the EXC_ADDR IPR, and th 
appropriate PALcode routine is dispatched. These operations occur unde 
direct control of the chip hardware, and the machine is now in PAE 

When the HW_REI instruction is executed at the end of the PAEé 
the hardware executes a jump to the address contained in nest 4 















PALcode: 


¢ When the 21164 is reset, it enters F i 
PALcode. The system will remair 










initial RESET PALcode e 
initialization is performe 


a rted. For additional PAL scratch, the Ibox has a register bank of 24 
‘Ltemp registers, which are accessible via HW_MTPR and HW_MFPR 
ructions. 
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6.4 PALcode Entry Points 


PALcode is invoked at specific entry points. The 21164 has two typ 
PALcode entry points: CALL_PAL and i 


6.4.1 CALL_PAL Entry — 











at the following offsets: 
e slate CALL_PAL instructions start at o 


PC<12> = CALL_PAL function field<7> 
PC<11:06> = CALL_PAL function field<5:0> 
“=PC<05:01> = 0 
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¢ PC<00> = 1 (PALmode) 
The minimum number of cycles for a CALL_PAL execution is 4: 


Number of 

Cycles Description 

1 Minimum TRAPB for empty pipe. Typically this wil 
1 Issue the CALL_PAL instruction. 








The minimum length of a PAL flow. Howevex, 


6.4.2 PALcode Trap Entry Points 
Chip-specific trap entry points start ) EXCA 







(Prioritization among the Detagi 
suppressed when there is a si 
interrupt is reversed:if ther‘ 











Descelion 
Reset 


Istream access violation or sign check error 
on PC 


Interrupt: hardware, software, and AST 
Istream TBMISS 
Dstream TBMISS 


Dstream TBMISS during virtual page table 
entry (PTE) fetch 


Dstream unaligned reference 
(continued on next page) 
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Table 6-1 (Cont.) PALcode Trap Entry Points 


Entry Name 
DFAULT 
MCHK 
OPCDEC 


ARITH 
FEN 


DRAINA 








Offseti¢ Description 

0380 Dstream fault or sign che <<: 
address 

0400 Uncorrected hardware® 

0480 Illegal opcode 








0500 
0580 


00.0002 
00.0000 





white 


ifiplementation of the Architecturally 
! Opcodes instructions 














isés the Alpha AXP instruction set for most of its operations. 

: ble 6-3 lists the opcodes reserved by the Alpha AXP architecture for 
mplementation-specific use. These opcodes are privileged and are only 
. available in PALmode. 
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Table 6-3 Opcodes Reserved for PALcode 
Architecture 

Opcode Mnemonic Function 
Performs Dstreait 


1B PAL1B 
instructions. 





21164 
Mnemonic 


HW_LD 














executed in kernel mode. Any so 
use extreme care to obey all res 







Note 
fig. i ired for accessing the hardware-specific 
‘registers. These constraints are described in 
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Figure 6-1 HW_LD Instruction Format 













26 25 21 20 16 15 14 13 12 111009 


Pr as ea ele 


Table 6—4 HW_LD Format Description 
Field Value Description 
OPCODE 1Bis The OPCODE eld 
RA 
RB 
PHYS 


WRTCK 


QUAD 


VPTE | a virtual PTE fetch. Used by trap logic to distinguish 


gle TBMISS from double TBMISS. Access checks are 
performed in kernel mode. 


Load lock version of HW_LD. PAL must slot to E0 pipe. 
Holds a 10-bit signed byte displacement. 
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6.6.2 HW_ST Instruction 


PALcode uses the HW_ST instruction to access memory outside of the reaim of 
normal Alpha AXP memory management and to do special forms of Dstre 
store instructions. hee 6—2 and Table 6—5 describe the > format, 
















LJ-03470-TI0 


Field 


OPCODE 
RA 
RB 
PHYS ‘ ca ; 
iddress for the HW_ST is physical. Translation 
ory-management access checks are inhibited. 
ALT 


ry-management checks use Mbox IPR ALT_MODE for 
checks. 


sngth is longword. 
Length is quadword. 


Store_conditional version of HW_ST. In this case, RA is written 
with the value of LOCK_FLAG. 


Holds a 10-bit signed byte displacement. 
HW_ST<13,11> must be zero. 
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6.6.3 HW_REl aructon 

















value of PALmode after the HW_REI instruction. 
. The Ibox uses the return prediction stack to speed the exes 
There are two different types of HW_REI: 


changes (such as ITB write instructions 
that PALcode can have only one veuen q 
instructions. 


Figure 6-3 and Table 6—6 desc h 
instruction. The Ibox logic will slot 1 
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e OPCODE field contains 1E,6. 
gister numbers, should be R31 to avoid unnecessary stalls. 


‘Normal version. 
‘Stall version. 


HW_REI<13:00> Must be zero. 


W_MFPR and HW_MTPR Instructions 


The HW_MFPR and HW_MTPR instructions are used to access internal state 
..from the Ibox, Mbox, and Dcache. The HW_MFPR from Ibox IPRs has a 
atency of one cycle (HW_MFPR in cycle n results in data available to the 
using instruction in cycle n+1). HW_MFPR from Mbox and Dcache IPRs has 
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a latency of two cycles. Ibox hardware slots each type of MXPR to the 
Ebox pipe (refer to Table 5—1). 


Figure 6—4 and Table 6-7 describe the format and fields of the HW..MFP 
HW_MTPR instructions. 
















Figure 6-4 HW_MFPR and HW_MTPR Instruction Format 


Field 


OPCODE 1916 
1D46 
RA/RB 


Index 
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Initialization and Con guratio 






Sysclk ratio and delay 
e Built-in Self-test (BiSt) 










Serial terminal port 






¢ Cache initialization 


External interface ini ; 







“the proper operating point. After power has reached 
operating point, signal de_ ok_ h must be asserted. Then, signal 







state. If signal de_ok_h is not asserted, signal sys_reset_l is forced 


asserted internally. After sys_reset_l is deasserted, the 21164 begins the 
ollowing sequence of operations: 


Ieache built-in self-test (BiSt) 


‘An optional automatic Icache initialization, using an external serial ROM 
(SROM) interface 
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3. Dispatch to the reset PALcode trap entry point (physical location 0) 


a. If step 2 initialized the Icache using the SROM interface, the cac 
should contain code that appears to be at location 0, that i ig 
should be initialized such that it hits on the dispatch. T: 
code in the Icache should configure the 21164’s IPRs as 
before causing any off-chip read or write commands. ¥! 
21164 to be configured to match the external system imp 


b. If step 2 did not initialize the Icache, the Icacheé 
reset. The reset PALcode trap dispatch misse 
(also flushed by reset) and produces an off-chip } 


‘Reset State 


NA (input). 
Clock output. 
NA (input). 


(continued on next page) 
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Table 7-1 (Cont.) Alpha 21164 Signal Pin Reset State 
Signal Reset State 
Clocks. 








osc_clk_in_ h,l Must be clocking. . 
ref_clk_in_h NA (input). 


sys_clk_out1_h,l Clock output. 
sys_clk_out2_h,l Clock output. 
sys_reset_l NA (input). 
Bcache 


data_h<127:0> 

data_check_h<15:0> 
data_ram_oe_h Deasserted. 
Deasserted, 





data_ram_we_h 
index_h<25:4> 
tag_ctl_par_h 
tag_data_h<38:20> 
tag_data_par_h 
tag_dirty_h 
tag_ram_oe_] 
tag_ram_we_h 
tag_shared_h 
tag_valid t 


(continued on next page) 
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Table 7-1 (Cont.) Alpha 21164 Signal Pin Reset State 
Signal Reset State 























System Interface 


addr_h<39:4> 
most recent sysclk edge. If driven, the val 


addr_bus_reg_h NA (input). 

addr_cmd_par_h Driven or tristated depending upofy 
most recent sysclk edge. If drive: 

addr_res_h<2:0> NOP. 

cack _h | Must be deasserted. 

cfail_h Must be deasserted. 

cemd_h<3:0> Driven or tristated dej 


" dack_h 
data_bus_req_h 
fill_h 
fill_error_h 
fill_id_h 
fill_ nocheck_h 
idle_bc_h 
int4_valid_h<3:0>: 
scache_set_h<1:0> 
shared_h 
system_lock_flag: 
victim_pendi i 





Interrupts 


Sysclk divisor ratio input. 
Sysclk delay input. 
Sysclk delay input. 
Sysclk delay input. 


(continued on next page) 
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Table 7-1 (Cont.) Alpha 21164 Signal Pin Reset State 
Signal Reset State 

Test Modes 
port_mode_h<1:0> NA (input). 
srom_clk_h Deasserted. 
srom_data_h NA (input). 


srom_oe_l Deasserted. 





srom_present_l NA (input). 
tck_h NA (input). 
tdi_h NA (input). 
tdo_h | NA Ginpat). 
temp_sense_h A: 
test_status_h<1:0> 
tms_h 
trst_l 


aie 4 


Miscellaneous 


perf_mon_h 


spare_io 


al sy xeset_l must remain asserted while signal dc_ok_h is deasserted, 
d for some period of time after de_ok_h assertion. It should remain asserted 
r at least 400 internal CPU cycles in length. Then, signal sys_reset_] may 
be deasserted. Signal sys_reset_l deassertion need not be synchronous with 
xespect to sysclk. 
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delay are determined by input pins while signal sys_reset_l remains ass : 
Refer to Section 4.2.2 and Section 4.2.3 for ratio and delay values... 


7.1.1 Power-Up Requirements 


The 21164 chip uses a 3.3-V dc power supply. This 3.3-V power: 
stable before any input or bidirectional pin rises above 4 


7.1.2 Pin ole with a ok_h Not Asserted 


pull-down transistor. | 
7.2 = beans Ratio and Delay 







arameters from the 
x with the correct 
tl is asserted. Refer to 
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srom_oe_] 
srom_clk_h 















first instruction fetch to miss the Icache and read instructi 
memory. 


If srom_present_] is asserted during setup, then thi 
SROM load as follows: : 


1. 
2. 


The format of the Icache data 
Chapter 12. 


7.5 Serial Terminal Po : 


: # Icache, the three SROM load signals 
‘an drive a diagnostic terminal using an interface 











red if the SROM load is bypassed. The initial Istream reference after 
Slacation 0. Because that is a cacheable-space reference, the Scache will 


‘he data cache (Deache) is disabled by reset. It is not initialized or flushed by 
-reset. It should be initialized by PALcode before being enabled. 


the external board-level Beache is disabled by reset. It should be initialized by 
*ALcode before being enabled. 
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7.6.1 Icache Initialization 


The Icache is not kept coherent with memory. When it is necessary to make 
it coherent with memory, the following procedure is used. The CALLPAL I 
function performs this function using this procedure. 



















1. Execute an MB instruction. This forces all writes in the w: 
memory. 


— Stall until write buffer is drained 
— Carry load or issue a HW_MFPR from any Mbox 

2. Write to IC_FLUSH_CTL with an HW_MTPR to flush ti 
Execute a total of 44 NOP instructions (BIS33: 


7.6.2 Flushing Dirty Blocks 
During a power failure recovery, dirty blo 
and backup cache (Beache), if pre : 
Systems Without a Bcache 
To flush out dirty blocks from: 












Besthe. When powering up, this field 
a 1M-byte Bceache. During system 


for this function in sys 
is initialized to as 






operation. 
To flush out the 











IZE><2:0> = 0x1; do loads at a stride of 64 bytes 
ontinuous memory; guarantees all dirty blocks from 


S et BC_CONFIG<BC_SIZE><2:0> = 0x4; do loads at a stride of 64 bytes 
_through 64K bytes of continuous memory; guarantees all dirty blocks from 
“set2 are flushed out. 


All ot ér values of BC_CONFIG<BC_SIZE><2:0> are undefined in this mode. 
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Systems with a Bcache 


To flush out dirty blocks from the Scache and Bcache on power fai 
following sequence must be used to guarantee that all the dirty blocks:h 
been written back to main memory: 


















perform loads at a stride of Bcache block size = 2x size 


7.7 External Interface Initialization 


After reset, the cache control and bus interface unié. (Ch 
configuration dictated by the reset state of the IPR’bi that ‘select the 
configuration options. The Cbox response ta: fem command 3 and vee 
_ generated memory accesses is determined: 
System environments that are not comp; 
must use the SROM Icache load {éature | 
program. This program configures thé:ext 
as needed. 
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Table 7-2 internal Processor Register Reset State 
IPR Reset State Comments 























Ibox Regi sters 


ITB_TAG UNDEFINED 
ITB_PTE UNDEFINED 
ITB_ASN UNDEFINED 
ITB_PTE_TEMP UNDEFINED 
ITB_IAP UNDEFINED 
ITB_IA UNDEFINED 
ITB_IS UNDEFINED 
IFAULT_VA_FORM UNDEFINED ;s. 
IVPTBR UNDEFINED 
ICPERR_STAT UNDEFINED 
IC_FLUSH_CTL 


EXC_ADDR 

EXC_SUM “PALcode must clear exception 
summary and exception register 
write mask by writing EXC_SUM. 

EXC_MASK 

PAL_BASE Cleared on reset. 

PS PALcode must set processor status. 

ICSR All bits are cleared on reset except 


ICSR<37>, which is set, and 
ICSR<38>, which is UNDEFINED. 


PALcode must initialize. 





3NDEFINED PALcode must initialize. 
UNDEFINED PALcode must initialize. 


UNDEFINED PALcode must initialize. 
UNDEFINED PALcode must initialize. 
UNDEFINED 


(continued on next page) 
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Table 7-2 (Cont.) Internal Processor Register Reset State 
IPR Reset State Comments 


SL_XMIT Cleared Appears on exterti ‘pi 


SL_RCV | UNDEFINED 

PMCTR | See Comments 

Mbox Registers 

DTB_ASN UNDEFINED 

DTB_CM UNDEFINED 

DTB_TAG Cleared Sbits < 
but not‘on:timeout reset. 

DTB_PTE 

DTB_PTE_TEMP ce 

MM_STAT Must be unlocked by PALcode by 

eading VA register. 

VA Must be unlocked by PALcode by 
reading VA register. 

VA_FORM | Must be unlocked by PALcode by 
reading VA register. 

MVPTBR _ . PALcode must initialize. 

DC_PERR_STA : PALcode must initialize. 

DTBIAP 

DTBIA 


DTBIS 


Cleared on chip reset but not on 
timeout reset. 


Cleared Cleared on chip reset but not on 


timeout reset. 
Cleared Cleared on chip reset. 
; MAF MODE<05> cleared on 
timeout reset. 
UNDEFINED PALcode must write this register to 


clear Deache valid bits. 


(continued on next page) 
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Table 7—2 (Cont.) Internal Processor Register Reset State 



















IPR Reset State Comments 

ALT MODE UNDEFINED | 
cc UNDEFINED CC is disabled on 
CC_CTL UNDEFINED : 
DC_TEST CTL UNDEFINED 


DC_TEST_TAG UNDEFINED 
DC_TEST_TAG_TEMP UNDEFINED 













Cbhox Registers 
SC_CTL See Comments | 
SC_STAT UNDEFINED* 
SC_ADDR UNDEFINED 
BC_CONTROL See Comments:::: _ 
BC. CRONTROL 06: 04> end <15> 
at on reset but not timeout reset. 
‘All other bits are UNDEFINED and 
must be initialized by PALcode. 
BC_CONFIG At power-up, BC_CONFIG 
is initialized to a value of 
0000 0000 0001 744116. 
BC_TAG ADDR 
EL STAT PALcode must read twice to unlock. 


EI_LADDR 
FILL_SYN 


Note 


Beachie. Yiarameters BC_SIZE (size), BC_RD_SPD (read speed), 

2 WR_ SPD (write speed), and BC_WE_CTL (write-enable control) are 
1 configured to default values on reset and must be initialized in the 
BC_CONFIG register before enabling the Bcache. 









7-12 Preliminary Edition—September 1994 


7.9 Timeout Reset 


The instruction fetch/decode unit and branch unit (Ibox) contains a tif 
that times out when a very long period of time passes with nosinstruc ¢ 
completing. When this timeout occurs, an internal reset eves 
clears sufficient internal state to allow the CPU to begin exé 
Registers, IPRs (except as noted in Table 7—2), and caches: 











is deasserted. Continuous trst_l assertion d 
guarantee that the IEEE 1149.1 test port d 
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without privileged architecture ream )EG 
tags are parity protected. The inst 
(Ibox) implements logic that d¢ 


very long time and forces a 


executed. 





EXC_ADDR contains either the PC of the instruction that caused the 
parity error or that of an earlier trapping instruction. 


CPERR_STAT<TPE> or <DPE> is set. 
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¢ Can be retried. 


Note 


The Icache is not flushed by hardware in this event. If an Ica¢ 
error occurs early in the PALcode routine at the machine 
point, an infinite loop may result. = 


¢ Recommendation: Flush the Icache early in the MCHK 
8.1.2 Scache Data Parity Error—istream 


* Machine check occurs before the instructiozi 
executed. 


SC_STAT: SC_ DPERR<T: : : 
multiple errors. : 


(Bit 4 eicaten Wi 
either octaword. ) 


If the Ist 


uesting a block of data with the same Bcache index, but a different 
“tag. This may not be feasible on tag parity errors, because the tag address 
_is suspect. If the requested block is loaded with no problems, then the 
“bad data” has been replaced. If the “bad data” is marked dirty, then when 
ienew data tries to replace the old data, another parity error may result 
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during the write-back (this is a reason not to attempt this in PAL code, 
because a MCHK from PALcode is always fatal). 













8.1.3 Scache Tag Parity Error—istream 


¢ Machine check occurs before the instruction causing the arity err 
executed. 


soulioles errors. 
e SC_STAT: CBOX_CMD is IRD 


¢ SC_ADDR: Contains the addréss:6f '2-byte block containing the error. 
(Bit 4 indicates which oct: essed first, but the error may be in 
either octaword.) 


SC STAT: CBOX_CMD is DRD, DWRITE, or READ_DIRTY. 


SC_ADDR: Contains the address of the 32-byte block containing the error. 
(Bit 4 indicates which octaword was accessed first, but the error may be in 
ither octaword.) 
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8.1.5 Scache Tag Parity Error—Dstream or System Commands 


8.1.6 Deache Data Parity Error 












Machine check occurs. Machine state may have changed. 


Cannot be retried. Probably will not be able to recover by deleting. 
process because the exact address is unknown. 


rnultiple ¢ errors. 


SC_STAT: CBOX_CMD ‘3 DRD, DWRITE, READ 
or INVAL. 


SC_ADDR: records physical address bits <39:04> of error. 
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8.1.7 Dcache Tag Parity Error 











¢ Machine check occurs. Machine state may have changed. 
¢ DCPERR_STAT: <TP0> or <TP1> is set. <LOCK> is set.;.. 
there are multiple errors. 
Note 


For multiple parity errors in the same cycle, th 
but more than one error bit will be set. 


error. 


MM _STAT locked. Contents @ptain 


EL ST. 4 <UNC _ECC_ERR> is set; <SEO_HRD_ERR> is set if there are 
multiple errors. 


EI_STAT: <EJ_ES> is set if source of fill data is memory/system; clear if 
Bcache. 


I_STAT: <FIL_IRD> is set. © 


Preliminary Edition-September 1994 8-5 


¢ EI_ADDR: Contains the physical address bits <39:04> of the octawaz 
associated with the error. . 


e FILL_SYN: Contains syndrome bits associated with the failing. 
This register contains byte parity error status if in parity mod 


¢ BC_TAG_ADDR: Holds results of external cache tag prob 
was enabled for this transaction. 









Note 


arty, ape when the new data 
arity error may result during 
mpt ‘this in PALcode, because a 


‘FILL_SYN: Contains syndrome bits associated with the failing octaword. 
is register contains byte parity error status if in parity mode. 
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8.1.10 


Bcache Tag Parity Errors—istream 





was s enabled for this transaction. 














Machine check occurs before the instruction causing th 


EL STAT: <EI_ES> is clear. * 
EL STAT: <FIL_IRD> is set. 
EI_ADDR: Contains the phy: 


tk ing fo the case in which a peters dirty block 
it victim being properly written back and the case 


© retried, but may only need to delete the process if data is 
confined to a single process and no second error occurred. Beache hit 

is determined based on the tag alone, not the parity bit. The victim is 
processed according to the status bits in the tag, ignoring the control 
field parity. PALcode can distinguish fatal from nonfatal occurrences by 
hecking for the case in which a potentially dirty block is replaced without 
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the victim being properly written back and the case of false hit whet the 
tag parity is incorrect. 


¢ EI_STAT: <BC_TPERR> or <BC_TC_PERR> is set; <SEO_HRD,_ERR> 3 
set if there are multiple errors. : 


e EILSTAT: <EI_ES> is clear. 
e EI_STAT: <FIL_IRD> is clear. 


¢ EI_ADDR: Contains the physical address bits <39: 
associated with the error. 











EI STAT: <EI_PAR_ERR> is set; 
multiple errors. 


EL STAT: <EIL_ES> is set. 





The:21164 does not check the ECC on outgoing Bcache data. If it is bad, the 
.. ¥eeiving processor will detect it. 
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8.1.14 Istream or Dstream Correctable ECC Error (Bcache or Memory) 





e The 21164 hardware corrects the data before filling the Scache an 
The Deache is completely invalidated. The data in the ee 









routine. (Using LDxL, STxC. If the STxC fails, the locat: 
to be scrubbed.) : 


¢ A separately maskable correctable error interrupt: 
as machine check). (Masked by clearing ICSR<@GRD 


e ISR: <CRD> is set. | 
EI_STAT: peecipaer is set. 





EL ADDR: Contains the shyerall 
associated with the error. 







memory # access timing), it is ae that the internal Ibox timeout will 
eventually detect a stall if a fill fails to occur. To properly terminate a 
fill in an error case, the fill_error_h pin is asserted for one cycle and the 
normal fill sequence involving the fill_h, fill_id_h, and dack_h pins is 
generated by the system environment. 
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e A fill_error_h assertion forces a PALcode trap to the MCHK entry point, 
but has no other effect. : 
Note 


No internal status is saved to show that this happened. If , 
systems must save this status, and include read operation: 
appropriate status registers in the MCHK PALcode. 


8.1.16 System Machine Check 


e The 21164 has a maskable machine check i 
system environments to signal fatal errors: 
to a read access from the 21164. It i is masked at IP 
21164 is in PALmode. 3 


e ISR: <MCK> is set. 
8.1.17 Ibox Timeout 


tly connected 
d.and anytime the 








e When the Ibox detects a tim 
oo point. 


ultaneously, a partial internal reset occurs: most states except IPR 
tate is reset. 


CPERR_STAT: <TMR> is set. 
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¢ This can be used to restore 21164 and the external environmentito a 
consistent state after the external environment detects a commaiid, or 
address parity error. 
























Note: 


There is no internal status saved to differentiate the 
case from the timeout reset case. If necessary, systems 
status, and include read operations of the appro] 
in the MCHK PALcode. 


8.2 MCHK dices 


source of a machine check. 


¢ Must flush Icache to remove bad dé 
buffer may be flushed by executing 
buffer with new data (32 in 


e Read EXC_ADDR. 
e If EXC ~ADDR=PAL 


{épendencies or MB to ensure read operation of SC_ADDR 
es before subsequent read operalicn of SC_STAT. 


Read SG {STAT (unlocks SC_ADDR). 


Read EI_ADDR, BC_TAG_ADDR, and FILL_SYN. 


Use register dependencies or MB to ensure read operations of E]_ADDR, 
BC_TAG_ADDR, and FILL_SYN finish before subsequent read operation of 
:.WI_ STAT. 


¢ Read EISTAT and save (unlocks ELADDR, BC_TAG_ADDR, FILL_SYN). 
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¢ Read EISTAT again to be sure it is unlocked, discard result. 
















then skip retry: 

— EI_STAT<TPERR> 

— EISTAT<TC_PERR> 

— EISTAT<EI_PAR_ERR> 

—- EJ_STAT<SEO_HRD_ERR> 
—- EI_STAT<UNC_ECC_ERR> and not EI_STATSEIL_IRD 
— DCPERR_STAT<LOCK> : 
— SC_STAT<SC_SCND_ERR> 

- SC_STAT<SC_TPERR> te. 


— Wot (SC_STAT<CMD> = IRD) an&SC_S 
— ICPERR_STAT<TMR> : 
— ISR<MCK> 


¢ If none of the previous con, 
can be retried or the sou 


*+<SC_DPERR> 


tions are trué; then there is either an IRD that 


The case can be 


* VA, SC_STAT, and EI_STAT are already unlocked. 
Check for arithmetic exceptions: 
.. Read EXC_SUM. 
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— Check for arithmetic errors and handle according to operating ‘system- 
specific requirements. : 


—- Clear EXC_SUM (unlocks EXC_MASK). 


¢ Report the processor-uncorrectable MCHK according to | 
specific requirements. 
















8.3 Processor-Correctable Error Interrupt 


¢ Read EIADDR and FILL_SYN. 
¢ Use register dependencies or MB to ¢ 


as CHIE . VA will not be locked because DIB_MISS and 
all not ever be interrupted. 


AF once in the CRD flow, and then only if ISR<CRD> 
j ctable error were to occur just after a second read 
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8.4 MCK_INTERRUPT Flow 


e Arrived here through interrupt routine because ISR<MCK> bit set. 


¢ Report the system-uncorrectable MCHK according to operatii 
specific requirements. 







8.5 System-Correctable Error Interrupt Flow (| 


The system-correctable error interrupt is system specif 
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e Electrical characteristics 
¢ dec characteristics 
¢ ac characteristics 


¢ Power supply considerations 


9.1 Electrical Characteristi 


mum Ratings 

Ratings 

~55°C to 125°C (-67°F to 257°F) 
15°C to 85°C (59°F to 185°F) 
Vss —0.5 V, Vdd 3.6 V 
3.3Vt05.5 V 


TBD W typical 
TBD W maximum 


Caution 


== Stress b yond the absolute maximum rating can cause permanent 
' damage to the 21164. Exposure to absolute maximum rating conditions — 
for extended periods of time can affect the 21164 reliability. 
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9.2 dc Characteristics 


The 21164 is designed to run in a CMOS/TTL environment. The 21164 i 
tested and characterized in a CMOS environment. 






















9.2.1 Power Supply 


The Vss pins are connected to 0.0 V, and the Vdd pins are cori 
+5%. 


9.2.2 Input Signal ii 


ose_clk_in_h,|) 
9.2.3 Output Signal Ping 


rail-to-rail, timing is specified to aniaat ss 


Bidirectional pins are either input or 


Table 9-2 CMOS DC Charac 





Parameter Requirements 
Symbol Description Max. Units Test Conditions 
TTL Inputs/Outputs 
= V = 
0.8 V — 
_ V Ton = -8.0 mA 
0.4 V Igt = 12.0 mA 
Power/Leakage 


k input leakage —50 50 pA 0.5 V <Vin <5.5V 
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always enabled, even when a pin is in the high-impedance state. This: 
that some current will flow from the 21164 Gf the pin has a pat 

































Signal Name Notes 

tms_h Has pull-up bleeder 
tdi_h Has pull-up bleed 
osc_clk_in_h 50 2 to Vos en 
ose_clk_in I os 
temp_sense 150 2 to V, 


The temp_sense pin must be : 


9.3 ac Snerectenstce 


esigners have a choice of two system clocking schemes to run the 
yichronous to the system: 





The 21164 generates and drives out a system clock, sys_elk_out1_h,l. It 
runs synchronous to the internal clock at a selected ratio of the internal 
clock frequency. There is a small clock skew between the internal clock and 
sys_clk_out1_h,lL. 
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2. The 21164 synchronizes to a system clock, ref_clk_in_h, supplied hy. 
the system. The ref_clk_in_h clock runs at a selected ratio of the 2: 













clock by an on-chip digital phase-locked loop (DPLL). 


Refer to Section 4.2 for more information on clock functions. 


9.3.2 Input Clocks 
The differential input clocks ose_clk_in_h,1 provide the:ti 


Module Circuitry 


osc_clk_in_h 





50 Q 
To 
Differential 
50 Q ifi 
Oscillator sat ea 
50 Q 
Capacitors 47 to 220 pF MLO-012929 


ock outputs follow the internal ring oscillator when the 21164 is running 
scillator, just as they would when an external clock is applied. The 
frequency of the ring oscillator varies from chip to chip within a range of 
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10 MHz to 100 MHz. This corresponds to an internal CPU clock frequency 
range of 5 MHz to 50 MHz. When signal de_ok_h is deasserted, thé system 
clock divisor is forced to 8, and the sys_clk_out2 delay is forced to 3. = 









A special on-chip circuit monitors the osc_clk_in pins and di tects 






across long traces. The clock input pins app 
resistor connected to a high srs vo 





source must be between 0. 
source may be used. Full- 


tor Side level. When connected this way, oscillators 
0 Vss can be used provided they can drive a wae 


Preliminary Edition—-September 1994 9-5 


Table 9-3 shows the input clock specification. 




















Table 9-3 Input Clock Specification 


Signal Parameter Nominal! Bin' 





osc_clk_in_h,] symmetry 
osc_clk_in_h,l minimum voltage 0.6 
osc_clk_in_h,] Z input 50 


1Minimum clock frequency = 10.0 bt (if lower, then “ing oscillato?: 
Maximum clock frequency = TBD MH 


9.3.3 Signal Characteristics 


osc_clk_in_h,] signals (see Table 9-3: 
All output signals are TTL compatible. 


9.3.4 Backup Cache Loop Timing 
The 21164 can be configured to_s1 


Table 9-4 Bcache Lo 





Signal Value Name 

data_h<127:05 Lins  Tdsu 
data_h<127:0>: 0.0 ns Tdh 
25 Tdd + 0.4 ns! Tiod 

Output hold time Tmdd Tioh 

Output delay Tdd + Teycle + 0.4 ns! Tdod 

Output hold Tmdd + Teycle Tdoh 


jue 0.4 ns accounts for on-chip driver delay and clock skew. 
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Outgoing Bcache index and data signals are driven off the internal clock edge 
and the incoming Beache tag and data signals are latched on the same: 
clock edge. Table 9-5 shows the output driver characteristics. 











Table 9-5 Output Driver Characteristics 
Specification 40 pF Load 10 pF Loa 


Maximum driver delay 2.6 ns 


Minimum driver delay 1.0 ns 






neue 9—2 shows the Bca 





c and Write timing. 
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Figure 9-2 Bcache Timing 























Beache Loop (Read) 


CPU Clock 


Index Out 


Data In Bee 


Beache Cycle 


Bcache Loop (Write) 


CPU Clock 


Index Out 


Data Out 


LJ-03409-TIO 


clock sys_clk_out1_h,l output timing. All 
swith the rising edge of the internal CPU clock. 
. times to be ae are of the 


tied toM a ee } 
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Table 9-6 Alpha 21164 System Clock Output Timing (sysclk=T,) 

















Signal Specification Value 
sys_clk outl Output delay Tdd 
sys_clk_out1_h,]l Minimum output delay Tmdd 
data_bus_req_h, Input setup 1.1 ns 


data_h<127:0>, 
addr_h<39:4> 


data_bus_req_h, Input hold 0 ns 
data_h<127:0>, 
addr_h<39:4> 





addr_h<39:4> Output delay Taod 
addr_h<39:4> Output hold time Taoh 
data_h<127:0> Output delay ee Tdod2 
data_h<127:0> Output hold tite Tdoh? 
















addr_bus_req_h Tabrsu 
addr_bus_req_h Tabrh 
dack_h Tntacksu 
cack_h Tntcacksu 
cack, dack Tntackh 


Tu 





rbo Mode* 





addr_bus_req_] 
cack_h, dack: 


l.l ns Ttacksu 





Ons Ttackh 


gure 9-3 shows sys_clk system timing. 


Preliminary Edition-September 1994 9-9 


Figure 9-3 sys clk System Timing 








Relationship of CPU Clock and sys_clk_out1 










Tsysd 
CPU Clock 


sys_clk_outt 


Memory Read (Turbo Made) 










Tsysd 


sys_cik_out1 — 


Taod 


CPU Clock 






Address/ = 
Command Out # 


CPU Clock 









Address/ yn 


Command Out sui: 


Tntackh 
LJ-03410-TIO 
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9.3.4.2 Reference Clocks 
Systems that generate their own system clock expect the 21164 to syfthronize 

























phase- locked loop (DPLL) to synchronize its sys_ clk out! 
system clock that is applied to the ref_clk_in_h signal. 





Signal Sacicwion . 

data_bus_req_h, Input setup : Tdsu 
data_h<127:0>, s 

addr_h<39:4> 

data_bus_req_h, Tsdadh 


data_h<127:0>, 
addr_h<39:4> 


addr_h<39:4> Tdd + 0.5 x Teycle+0.9ns! Traod 


addr_h<39:4> Tmdd Traoh 
data_h<127:0> Tdd + 1.5 x Teycle+0.9ns' Trdod? 
data_h<127:0> Tmdd + Teycle Trdoh? 


Non-Turbo Mode 


addr_ bus req | 3.8 ns Tntrabrsu 
: 0.5 x Teycle Tntrabrh 
Input setup 3.3 ns Tntracksu 
Input setup 3.7 ns Tntrcacksu 


‘The value 0.9 fis accounts for on- -chip skews that include 0.4 ns for driver delay and clock skew, 
id phase detector skews due to circuit delay (0.2 ns) and delay in ref_clk_in_h due to the 
“package (0.3 ns). 


2For all write transactions initiated by the 21164, data is driven one CPU cycle later. 


(continued on next page) 
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Table 9-7 (Cont.) Alpha 21164 Reference Clock Input Timing 

Signal Specification Value 
Non-Turbo Mode 

- (0.5 x Teycle) 











cack_h, dack_h 








Input hold 








Turbo Mode * 


Input setup 1.1 ns x racksu 





addr_bus_req_h, 
cack_h, dack_h 


addr_bus_req_h, Input hold 0.5 x Teycle Ttrackh 
cack_h, dack_h 






3In turbo mode, control signals are piped on chip for one sys 
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9.3.4.3 Digital Phase Locked Loop 


Figure 9-4 and Table 9-8 describe the digital sisi inckea loop (WEE ) stages 
of operation. 




















Figure 9-4 ref_cik System Timing 


Relationship of CPU Clock and ref_clk_in 


CPU Clock 


ref_clk_ 


Relationship of CPU 


CPU Clock 


ref_clk_in 


sys_clk_out1 


Tsysd 
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Table 9-8 ref_clk System Timing Stages 
Stage Description 


















eo The internal CPU clock rising edge coincides with the risin Bae. 
ref_clk_in_h. 


The DPLL causes the internal CPU clock to stretch for.gné phase (1 cycle” 


of osc_clk_in_h,]). 


2) 

3] The stretch causes ref_clk_in_h to lead the inte! 
phase. 

1] 


The CPU clock is eee ~sectia faster than t 


systems, because the 21164 uses sys_ Oe 
timing during system transactions. 


9.3.4.4 Timing—Additional Signals 
This section lists timing for all o 


perf_mon_h 
irq _h 
mceh_hlt_irg_h 


clk_mode_h 
sys_mch_chk_irg |] 
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Table 9-9 Input Timing for sys_cik_out- or ref_cik_in-Based Systems 

















Value 
Signal Specification sys _clk_out ref_clk_in 
cfail_h, fill_h, fill_error_h, fill_id_h, Input setup 1.1 ns 1.1 ns 


fill_nocheck_h, idle_bc_h, shared_h, 
system_lock_flag_h 


Testability pins: 
port_mode_h, srom_data_h, 
srom_present_l 


efail_h, fill_h, fill_error_h, fill_id_h, Input hold 
fill_nocheck_h, idle_be_h, shared_h, 
system_lock_ flag h 


Tsdadh 


Testability pins: . 
port_mode_h, srom_data_h, 
srom_present_l 


Clocking System Name 






Signal Specification sy: ‘ sys_clk_out  ref_clk_in 





















Unidirectional Signals 


Taa +0.5*T ycje+0.9 ns 





addr_res_h, 
int4_valid_h,} 
scache_set_h, 
srom_clk_h, 
srom_oe_]l, 
victim_pending_h 


addr_res_h, 

int4_valid_h,! 
scache_set_h, 
srom_clix_h, 
srom_oe_l, 





Taod Trad 


Tmaa Teoh Trach 


‘Qutput delay TaatT cycte +0.4 ns Taat1.5*T yce+0.9 ns T aoa Trdod 


put hold TmaatT cycle Tmaa +T eycte Taoh Traoh 






(continued on next page) 
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Table 9-10 (Cont.) Output Timing for sys_clk_out- or ref_cik_in-Based System 
Clocking System Value Clocking System N 






































Signal Specification sys _clk_out ref_clk_in 
Bidirectional Signals 


Input mode: 


addr_cmd_par h, Input setup 1.1 ns 1.1 ns 
emd_h, 

data_check_h,! 
tag_ctl -par_h,* 


tag_ _h, 
tag_shared_h® 


addr_cmd_par_h, Input hold 0 ns 
emd_h, 
data_check_h,! 

tag_ctl_par_h,? 

tag_dirty_h,® 

tag_shared_ "h? 


Output mode: 


addr_cmd_par_h, Outputdelay TT gg+0.4 ns 
emd_h, 
tag_ctl_par | h,* 
tag_dirty_h,* 
tag_shared_| 'h, 4 
tag_valid_h* 


data_check_h? Output delay 


addr_cmd_par_h, Output 
cmd_h, 

tag ctl _par_h 
tag_dirty_h4 
tag_shared_h,* 
tag_valid_h‘ 


data_check_h? 


Taat 15 *Teyctet9-9ns Taog Tyrdod 


Dnaa Taoh Trach 


cycle Tnaa +Teycle Toh Trdoh 





1Read transaction 
2Write transaction 
3Pills fror 


ils in Table 9-11 are used to control Bcache data transfers. These signals 
re driven off the CPU clock. The choice of sys_clk_out or ref_clk_in has no 
‘pact on the timing of these signals. 
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Table 9-11 Bcache Control Signal Timing 


Signal Specification Value 
Input mode: 

tag_data_h, iuiedatind par_h, Input setup 1.1 ns 
tag_valid_ h ) . 
tag_data_h, tag_data_par_h, Input hold 
tag_valid_h 

Output mode: 

data_ram_oe_h, data_ram_ we_ h,! Output delay 
tag_ram_oe_h, tag_ram_we_h! 

tag_data_h, tag_data_par_h, Output dela: 
tag_valid_h 


data_ram_oe_h, data_ram_we_h,} 
tag_ram_oe_h, tag_ram_we_h 
tag_data_h, tag_data_par_h, 
tag_valid_h 


Output hak: 


frequency is divided 
circuitry. 


a frequency that the chip manufacturing tester is 
‘Givide-by-1 mode has been designed into the clock 


ot asserted, the clock frequency that is applied to the 
ek, signals osc -clk_ in_h,l bypasses the clock divider and is sent to 


peed with a one-half frequency (up to 294 Mhz) ose_clk_in_h,l. 
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9.3.5.3 Module Test Mode 


When the clk_mode_h<0> signal is not asserted and clk_mode_h<1>* 
is asserted, the clock frequency that is applied to the input clock neon 
osc_clk_in_h,] is divided by 4 and is sent to the chip clock driv 
phase-locked loop (DPLL) continues to keep the on-chip SyS_c 




















9.3.5.4 Clock Test Reset Mode 


When both the clk_mode_h<0> and the clk_mode_| 
the sys_clk_out generator circuit is forced to reset t 
allows the chip manufacturing tester to synchroni 
cycle. Table 9—12 lists the test modes. 


Table 9-12 Test Modes ae, 
* Mode 















clk_mode_h<0> 
Normal 
Chip test 

Module test 
Clock reset 


input levels of 0.8 
point of Vdd/2. 








iy 
If the line is driven in ‘the electrical center, the load could be as low as 


. Therefore, a characteristic impedance range of 30 2? to 90 22 could be 
erienced. 
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impedance, ranging from 30 2 to 90 2. 





To prevent signal quality problems due to overshoot or rin 


Specification 
TBD 


16.6 MHz 
4ns 
4ns 


Maximum propagat se ti tdo_h Ugierenced to 14ns 


tck_h falling edge) 
= 20 ns 


the 21164 pins, and not just at the PCB edge. 


Plus 5 V is not used in the 21164. The voltage difference between the Vdd pins 
it Vss pins must never be greater than 3.6 V. If the differential exceeds this 
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9.4.1 Decoupling 


frequency capacitors placed close to the chip package’s power 
with very short module etch will give best results. ae 


the 21164’s V/O circuits, the I/O pin voltages must 
ile Vdd supply is at least 3 V or greater. 


se ee but it is required for the 21164. 


hree-terminal voltage regulator can be used to make 3.3-V Vdd from the 5-V 
supply, provided the output of the regulator (Vdd) tracks the 5-V supply with 
- small offset. The requirement is that when the 5-V supply reaches 4 V, 
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Vdd must be 3 V or higher. While the 5-V supply is below 4 V, Vdd ¢ 


be less 
than 3 V. 











All 5-V sources on the 21164’s I/O pins should be disabled if the power upply 
sequencing is such that the 5-V supply will exceed 4 V before: E 

least 3 V. The 5-V sources should remain disabled until the 
is equal to or greater than 3 V. 





Disabling all 5-V sources can be very difficult becaus 






of current, and will put a voltage across a 21164 Ve 
the (no higher than 4 V until there is 3 V) rule. 


terminations. 
Disabling the non-21164 5-V out 





— supply supervisor circuit produces 
Paven with 5-V outputs into the high impedance 
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Thermal Manageme 















This chapter describes the 21164 thermal manageftie 
considerations. = 


10.1 Thermal Specifications 


thermal resistance. 


Operating Temperature 


(Tj — Te) 


5a = O;hs + 0,84 
T; =Ta+ P+ +0;a 


ie symbols in the previous equations are defined as follows: 


9;a is the junction-to-ambient thermal resistance (°C/W). 
0;hs is the junction-to-heat-sink thermal resistance (°C/W). 
. 0,8a is the heat-sink-to-ambient thermal resistance (°C/W). 
T; is the maximum junction temperature (°C). 
T, is the ambient temperature (°C). 
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Tc is the heat-sink temperature at a predefined location (°C). 
P is the power dissipation (W). 






















Table 10—1 lists the values for the center of heat-sink-to-ambient @ 
499-pin grid array. Table 10—2 shows the allowable T, (without ex¢ 
at various airflows. 

Note 


Digital recommends using the heat sink because i 
the ambient temperature requirement. 


Table 10-1 ¢,a at Various Airflows 


Airflow 100 





200 2 


(ft/min) 

§.a with heat sink #1 2.30 1.30 0.45 0.41 
(°C/W) 

6.a with heat sink #2 1.25 0.35 0.32 
(°C/W) 


Frequency: 266 MHz 





Table 10-2 Maximum Tz at Various A 
Airflow 
(ft/min) 


T, with heat sink #1 
(°C) 

T, with heat sink #2 
CC) | 
Frequency: 266 MH# 
Power: 45 W 


a= 400 600 300 1000 
50.5 582 618 63.6 


60.4 64.0 66.3 67.6 
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10.2 Heat Sink Specifications 


Two heat sinks are specified. Heat sink type #1 mounting holes are in: 
the ai fins. Heat sink type #2 Mounting holes are rotated: 










along with their approximate dimensions. 


Figure 10-1 Type #1 Heat Sink 


| ‘ 6.57 cm 
(2.588 in.) 


ii 


6.57 cm 
(2.585 in.) 





3.25 cm 
(1.280 in.) 


MLO-012428 
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Figure 10-2 Type #2 Heat Sink 







3.80 cm 
(1.495 in.) 


:.printed:circuit board (PCB) component placement: 
=:RCB with the heat sink fins aligned with the 


ieee of the heat sink. 
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This chapter describes the Alpha 21164 microg 
including chip package physical specificati 


sink dimensions, refer to Chapter 10. 
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Figure 11-1 Package Dimensions 
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26.67 mm 
(1.050 in) 


57.40 mm (2.260 in) 


28.70 mm 
(1.130 in) Typ 






25.40 mm 
(1.000 in) Typ 






38.10 mm 
(1.500 in) Typ 
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1.27 mm (0.050 in) Typ 
4.32 mm (0.170 in) Typ 


2.54 mm (0.100 in) Typ 
Standoff (4x) 


499x 1.40 mm (0.055 in) Typ 
1.27 mm (0.050 in) Typ 


(0.300 in} Typ 


-130 in) Typ 


Capacitors (12x) 
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11.2 Signal Descriptions and Pin Assignment 
This section provides detailed information about the 21164 pinout. Thi 21164 


has 499 pins aligned in an interstitial IPGA design. 
11.2.1 Signal Pin Lists 


spare (unused) signal pins, 104 power (Vdd) pins, 


for a total of 499 pins in the array. 


Table 11-1 Alphabetic Signal Pin List | 


Signal 


addr_bus_req_h 


addr_h<5> 
addr_h<8> 
addr_h<11> 
addr_h<14> 
addr_h<17> 
addr_h<20> 


addr_h<23> 


addr_h<26> 
addr_h<29> 
addr_h<32> 
addr_h<35> 
addr_h<38> 


addr_res_h<1> & 


efail_h 


PGA — 
Location § Signal 


addr_emd_par k 
addr h<6> 
addr_h<9> 


_mode h<0> 

cemd_h<1> 
epu_clk_ out_h 

' data_check_h<0> 
data_check_h<3> 


addr_h<16> 
addr_h<19> 
C05 addr_h<22> 
addr_h<25> 
AW35 addr_h<28> 
BC35 addr_h<31> 
BA33 addr_h<34> 


— AV30 addr_h<37> 


BB30 addr_res_h<0> 


E27 cack_h 
AUa2l1 ells mode _h<1l> 
Aig emd_h<2> 

-BA25 dack_h . 
J41 data_check_h<1> 
G43 data_check_h<4> 


(continued on next page) 


Preliminary Edition-September 1994 11-3 


































PGA 
Location 


BB14 
AV14 
BA11 
BCO09 
AW09 
AV08 
BC39 
BA37 
AV34 
AW33 
BC33 
BA31 


. C27 


G21 
BA23 
C19 
B24 
K38 
G41 


Table 11-1 (Cont.) Alphabetic Signal Pin List 


Signal 





data_check_h<5> 
data_check_h<8> 
data_check_h<11> 
data_check_h<14> 


data_h<1> 

data_h<4>. 
data_h<7> 

data_h<10> 
data_h<13> 
data_h<16> 
data_h<19> 
data_h<22> 
data_h<25> 
data_h<28> 
data_h<31> 
- data_h<34> 
data_h<37> 
data_h<40> 
| data_h<43> 
data_h<46> 
data_h<49> 
data_h<52> 
data_h<55> 
data_h<58> 







PGA 


H38 
J03 

G01 
G05 
L39 
1A3 
N41 
R39 


U41 
W4i1 
Y¥42 
AA43 


Location Signal 


PGA 
Location Signal 





data_check_ h<6> G39 
data_check h<9> _K06 
data_check_h<12> 
data_check_h<15> 
data_h<2> 

data_h<5> 

data_h<8> 

data_h<1l> 
data_h<14> 
data_h<17> 
data_h<20> 
data_h<23> 
data_h<26> 


data_check_h<7> 
data_check_h< b: 


data_h<33> AE41 
data_h<36> AG41 
data_h<39> AJ43 
data_h<42> AJ39 
data_h<45> AL41 
data_h<48> AN43 
data_h<51> — AN39 
data_h<54> AP38 
data_h<57> AU41 
data_h<60> AW43 
data_h<62> data_h<63> AW39 
data_h<65> data_h<66> M06 
data_h<68> data_h<69> N05 
data_h<71> N03 data_h<72> NO1 


(continued on next page) 
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Table 11-1 (Cont.) Alphabetic Signal Pin List 
PGA 






PGA 
Location Signal 











Signal Location Signal 
data_h<73> Po2 data_h<74> 
data_h<76> R03 data_h<77> 
data_h<79> V06 data_h<80> 
data_h<82> W05 data_h<83> 
data_h<85> YO6 data_h<86> 
data_h<88> AAO3 data_h<89> 
data_h<91> ACOl _— data_h<92> 
data_h<94> ADO2 data_h<95> 
data_h<97> AE03 data_h<98> 
data_h<100> AG03 data_h<101> 
data_h<103> AJO1 data_h<104> 
data_h<106> AJO5 

data_h<109> ALO03 

data_h<112> ANO1 

data_h<115> 

data_h<118> 

data_h<121> 

data_h<124> 

data_h<127> 

dc_ok_h 

fill_id_h 


index_h<4> 
index_h<7> 
index_h<10>- “index_h<11> 

F i index_h<14> 
index_h<17> 
index_h<20> 


index _h<23> 



























R05 data_h<75> : 
R01 3 
U03 
Wo03 
YO2 AA05 
AAOI ABO6 
AC05 
AEO1 
AGO1 
data_h<lr0e2> AG05 
fita_h<105> AH06 
data_h<108> ALO1 
data_h<111> AL05 
data_h<114> AMO06 
data_h<117> ARO3 
data_h<120> AUOl1 
data_h<123> AU05 
data_h<126> AV06 
data_ram_we_h A23 
fill_h G23 
idle_bc_h A27 
index_h<6> F28 
index_h<9> A31 
F30 index_h<12> E31 
C33 index_h<15> F32 
A35 index_h<18> C35 
B35 index_h<21> A37 
F36 index_h<24> E37 


(continued on next page) 
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Table 11-1 (Cont.) Alphabetic Signal Pin List 





































PGA PGA 
Signal — Location Signal Location Signal 
index_h<25> A39 int4_valid_h<0> F38 int4_valid_h<1> 
int4_valid_h<2> F06 int4_valid_h<3> E03 irq h<0> 
irg_h<1> AU27 irg_h<2> 
meh_hlit_irg_ h AU25 osc_elk in h 
perf_mon_h AW29 port_mode_h<0>__ 
pwrfailirgh | AV26 _ refclkinh 
scache_set_h<1> Al7 shared_h 
srom_data_h BC19 srom_oe_l 
system_lock flag h G27 sys_clk outl_h 
sys_clk out2_h AV24 sys_clk_ out2_l 
sys_reset_l BC27 
tag_data_h<21> E07 
tag_data_h<24> AOT 
tag_data_h<27> Cog 
tag_data_h<30> F12 tag_data_h<32> All 
tag_data_h<33> E13 tag_data_h<35> C13 


tag_data_h<36> 
tag_data_par_h 


tag_data_h<38> E15 
tag_ram_oe_h C21 
tag_valid_h F16 
tdo_h BA17 
test_status_h<1> AV16 


tag_ram_we_h 
tek_h 
temp_sense 
tms_h 


spare_in<438> 


victim_pending_h E21 
spare_io<250> 


(continued on next page) 
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Table 11-1 (Cont.) Alphabetic Signal Pin List 








Signal 


Vss—Metal planes 
2! and 5? 


Vdd 
Metal planes 4 and 6 


1Metal plane 2—Seal ring connection tied to Vss 
2Metal plane 5—Heat slug braze pad connections t: 






PGA Location 


A03, A41, AAO7, AA37, ACO7, AC37, ADO4, AD40, AFO2, AF42,"A 
AH40, ALO7, AL37, AM04, AM40, APO2, AP42, ARO7, AR37,404, 















G13, G17, G31, G35, H04, 40, ‘J07, J37, Ko2, K42,2 
U07, U37, V02, V42, Y04, Y40 


AB02, AB04, AB40, AB42, AE07, AE37, AF04, 






AU15, AUL9, AU29, AU33, AU37, AV04 "AV40 ‘ 
AY26, AY30, AY34, AY38, AY42, B04, BO. $ "B 

= _BB3e: BB36, BB40, BC23, C03, 
C05, C39, C41, DO02, D6, D10, Du, 8 30, D34, D38, D42, F04, F40, 
Gil, G15, G19, G29, G33, Gar, H02;HAz; Fis 
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11.2.2 Pin Assignment 
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Figure 11-2 shows the 21164 pinout from the top view with pins facing déven. 
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Figure 11-2 Alpha 21164 Top View (Pin Down) 
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BA 
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Figure 11-3 shows the 21164 pinout from the bottom view with ping#facing 
up. 









Figure 11-3 Alpha 21164 Bottom View (Pin Up) 
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Testability and Diagnostics 


The 21164 has several internal testability fea 
factory use only. These features are beyonég: 















Pin Name 
port_mode_h<I> 
port_mode_h<0 = 


srom_present:..] 


srom_data_h/Rx * 
srom_clk_h/Tx 


Receives SROM or serial terminal data. 


Supplies clock to SROMs or transmits serial 
terminal data. 


SROM enable. 

IEEE 1149.1 TDI port. 

IEEE 1149.1 TDO port. 

IEEE 1149.1 TMS port. 

IEEE 1149.1 TCK port. 

IEEE 1149.1 optional TRST port. 
Indicates Icache BiSt status. 





srom_oe_] 


st_status_h<0> 
test_status_h<1> 


Oo oO -— #2 4 6 = OF 


Outputs an IPR-written value and timeout reset. 
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12.2 Test Interface . 
The 21164 test interface supports a serial ROM interface, a serial Basie : 











available and set to normal test interface mode when port. mode... 
Driving these pins to a value of anything other than 00 redefine 
test interface pins and invokes special factory test modes not* : 
document. 


12.2.1 SROM Port 


Signal pins srom_present_l, srom_data_h, srom_ 
constitute the SROM interface. 





















Bs 
, then the:21164’s reset 
s before executing its first 


sequence, susie the first instru 
instructions from the off-chip me 


During the SROM load: 


¢ Signal srom oct supp! 


enough Alpha AXP code to complete the configuration 
fnal interface (for example, setting the timing on the external cache 
fagnosing the path between the CPU chip and the other ROMs). 


: usion of the Icache self-test. This gives the code loaded into the Icache 
ess to all of the visible state within the chip. 


o Section 12.3 for details of the Icache fill operation from SROMs. 
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12.2.2 Serial Terminal Port 


Once the data in the SROM has been loaded into the Icache, the th ¢ 
SROM port pins turn into a simple serial I/O pins that can be used t 2 3 
diagnostic terminal through an interface such as RS422. : 
















When the SROM is not being read, the srom_oe_l output, 
serial diagnostic terminal port is enabled if this pin is wired 


12.2.3 IEEE 1149.1 Test Access Port 
Pins tdi_h, tdo_ h, tek_h, tms_ hi sand 5 


access to factory manufacturing. 
port is ee with most re 









? The optiénal trst_l pin has an internal pull-down, instead of a pull-up 
as required by IEEE 1149.1 (non-complied spec 3.6.1(b) in IEEE 1149.1— 
1993). The trst_l pull-down allows the chip to automatically force reset 
to the IEBE 1149.1 circuits in a system in which the IEEE 1149.1 port is 
unconnected. This may be considered a feature for most system designs 
that use IEEE 1149.1 circuits solely during module manufacturing. 
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2. Coverage of oscillator differential input pins 











The two differential clock input pins, osc_ elk _ in_ h and osc_clk_ in_ 


in the boundary scan register in n bit position 33 (at pin dc_ 
captures the output of a “clock sniffer” circuit. It capture 
oscillator is connected, and captures a “0” if the chip’s oscilla 

are broken. 


This exception to the standard is made to permit 2 
oscillator input pins. 





Figure 12-1 IEEE 1149.1 Test Access Pé6rt. 






TRST_L —s 
TAP Controller : 
TSH] 3 : CONTROL 


TCK_H] 4 


TDO_LH <€_| 


TOoLH [| > 


e Dieé=ID Register (IDR) — | 


as Boundary Scan Register (BSR) 


LJ-03463-TI0 


Controller 


Th AP controller contains a state machine. It interprets IEEE 1149.1 
protocols received on signal tms_h and generates appropriate clocks and 
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control signals for the testability features under its jurisdiction. Thé.state 
machine is shown in Figure 12-2 






Figure 12-2 TAP Controller State Machine 







K Test Logic 
Reset 
[i Run_—Test/Idle 






“Scan Sequence 





Scan Sequence” 
MK-1455-08 


nstructions for public and private factory use. Table 12-3 summarizes the 
“public instructions and their functions. 
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value 00001. This automatic load feature is useful for testing the inte 
the IEEE 1149.1 scan chain on the module. 


Table 12-3 Instruction Register 


Scan Register 
IR<4:0> Name Selected 





Operation 


00000 EXTEST BSR 
00010 ‘SAMPLE/ BSR 
PRELOAD 

00010 Private BSR 
00011 Private BSR 
00100 CLAMP BPR 
00101 HIGHZ BPR 
00110 Private IDR 
00111 Private IDR 
01000 Private BP] 
through = 
11110 


12.2.4 


During Icache BiSt Operation 


test_status_h<<0> is forced high at the start of the Icache BiSt. If the 
euéhe BiSt passes, the pin is deasserted at the end of the BiSt operation, 
otherwise it remains high. 
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e IPR read and write operations to test status pins 











PALcode can write to the test_status_h<1> signal pin and can Te 
test_status_h<0> signal pin through hardware IPR access. Refer 
Chapter 6. 


¢ Timeout Reset 


the 





In either of these conditions, the CPU signals the 
outputting a 256 CPU cycle wide pulse o 
pulse on test_status_h<1> pin is clock 
as an pierre 256 cry cycle R j 


therefore appears 
s and falls on system 
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srom data h serial input -> 












BHT Array Oo=> 28> via ee 2 A> 
Data 127 -> 95 -> 126 -> 94 ->... => 
Predecodes 19 +> 14-> 18 => 13 >>... 
Data parity 1-> O-> 
Predecodes 9-> 4-> BSP - 3. Stes 
Data 63.->- 31--> 62:=> 30> 22. 
Tag Parity b -> 
Tag Valids 0-> 1-> 
TAG Phy.Address b -> 
TAG ASN 0 -> LS gy Se 
TAG ASM b -> 

3-> 14-> ... => 


TAGs 1 
- b = Single bit signal 





12.4 Boundary Scan Register 


long. Table 12-4 provides 
E poonesicd between the 


e ‘control cell tristates the output 
e group are configured as input pins. 
iftetias groups gr_1 through gr_7 in the 


drivers and all bidirectional. pit ii 
The bidirectional Pin: groups 


tdo_h pin prior to signal<a>. 
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Table 12-4 Boundary Scan Register Organization 


Pin BSR BSR Control 
Type Count Cell Type Group 






Signal Name 

























TR_ADL Control 0 io_bcell 
addr_h<21:4> B 1:18 io_bcell 
temp_sense_h O — None 
test_status_h<1:0> O 19:20 io_bcell 
trst_l B — None 
tek_h B — None 
tms_h | B — None 
tdo_h O 
tdi_h B 
srom_oe_l O 
srom_clk_h O 
srom_data_h I 
srom_present_] B 
port_mode_h<0:1> I Compliance enable pins. 
clk_mode_h<0> I — 
ose_clk in_h,] I Analog pins. 
clk_mode_h<1> I: in_bcell — —_ 
sys_clk_ outl_h,J Oo io_bcell — = 
sys_clk_out2_h,] io_bcell — —_— 
cpu_clk out_h none — For chip test. 
ref_clk_in_h in_bcell — — 
sys_reset_l 32 in_bcell _ —_ 
dc_ok_h — in_bcell — Compliance enable pin. 
Osc_Sniffer_h_ 33 in_bcell — Captures 1 if osc is 
connected, otherwise 
captures 0. 
I 34 in_bcell ~— — 
I 35 in_bcell — — 
I 36 in_bcell — — 


(continued on next page) 
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Table 12—4 (Cont.) Boundary Scan Register Organization 



































Pin BSR BSR Control 
Signal Name Type Count Cell Type Group 
irg_h<3:0> | | 37:40 in_beell  — 
SPARE_I0<250> B 41 iobeell  — 
perf_mon_h I 42 in_bcell 
TR_ADR Control 43 io_bcell 
addr_h<39:22> : B 44:61 io_bcell 
TR_DDR | Control 62 io bell 
data_h<63:0> B—s63:126 ~—sio_beelll 
data_check_h<0:7> B 127:134  io_bcell 
int4_valid_h<1:0> O do_bee 
SPARE_I0<438> — SS Lower right corner, 
unpopulated. 
index_h<25:4> O — 
addr_res_h<2:0> oO — 
idle_be_h I ss 
system_lock_ flag h I — 
data_bus_req_h I. — 
efail_h I — 
fill_ nocheck_h — 
fill_error_h in_beell — — 
fill_id_h in_bcell — os 
fill_h in_bcell — — 
dack_h in_bcell — — 
- addr_bus_req h inbeell  — ee 
in_bcell — — 
in_bcell — — 
io_bcell — — 
io_bcell — — 
io_bcell — — 


(continued on next page) 
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Table 12—4 (Cont.) Boundary Scan Register Organization 


Signal Name 


tag ram_oe_h 
victim, _pending h 
TMIS1 
addr_cmd_par_h 
emd_h<0:3> 
scache_set_h<1:0> 
TTAG1 
tag_ct]_par_h 
tag_dirty_h 
tag_shared_h | 
TTAG2 
tag_data_par_h 
tag_valid_h 
tag_data_h<38:20> 
SPARE_I0<002> 


int4_valid_h<2:3> 
TR_DDL 
data_check_h<15:8> 
data_h<64:127> 


Pin 
Type 

O 

O 
Control 
B 

B 

O 
Control 


BSR 
Count 
177 

178 

179 

180 
181:184 
185:186 
187 
188 
189 
190 
191 

























BSR Control 


Cell Type Group Remarks 


io_bcell _ 
io_bcell — 
io_beell 
io_bcell 
io_bcell 
io_bcell 
io_bcell. 


Lower left corner, 
unpopulated. 


io_bcell gr_7 — 
io_bcell gr_7 —_ 
io_bcell 
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12.5.1 Icache BiSt Operation Timing 


The Icache BiSt is invoked by deasserting the external reset signal 
sys_reset_l. Figure 12—3 shows the timing between various events, 
BiSt operations. 










Figure 12-3 BiSt Timing Event-Time Line 


Deassert BiSt Start 
sys_reset_! (test_status_h<1:0>=01) 





MK-1455-09 


the end of the time fie sh 
time line shown in Figur 


22644+2% 22645 
19721+2% 19722 
13291+14% 13292 
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12.5.2 


Table 12-6 BiSt Timing for Some System Clock Ratios, Port Mode=Normal 


















(CPU Cycles) 
Syscik CPU Cycles 
Ratio ty te tg 
3 24 679344 67935 
4 28 788864 78888 
15 105 199379% 199380 


Automatic SROM Load Timing 


the SROM interface port. 
Timing events are shown in Fi 
Table 12-8. 


Figure 12-4 SROM Load. 


BiSt Done 


Deassert 
Last Rise Interna! Reset Deassert 
srom_cik_h (T%Z_RESET BL) srom_oe | 


MK-1455-10 
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Table 12-7 SROM Load Timing for Some System Clock Ratios (Systefft. 


Cycles) 
Syscik § System Cycles’ 
Ratio ty to ts te ts 
3 4 22 4408090  4408216+44 4408217 
4 3 48 3306099 3306193+214 3306194 


15 3 13 881627 881651+9'2 88165 





1Measured in sysclk cycles, where +n refers to an additional n CP 





CPU Cycles) 








Sysclk CPU Cycles 
Ratio ty to 

3 12 66 
4 12 192 
15 45 195 


































EERIE ERASERS 
ey X 





















teu" 4 x sysclk period + 1.1 ns 102,400 Bits Total 
tho =Ons MK-1455-07 


the required setup time) = [126 — (5 « sysclk ratio)] *« (CPU cycle time). 
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_ Alpha AXP Instruction Set 

















A.1 Alpha AXP Instruction summary — 


This appendix contains a summary of all Alpiia:AXP 
All values are in hexadecimal radix. Tabl eri 
Format and Opcode columns that are in. 


Instruction 
Format 


Branch oo is the 6-bit opcode field. 
Floating- 00 is the 6-bit opcode field. 
point fff is the 11-bit function code field. 
Memory oo is the 6-bit opcode field. 
Memory/ oo is the 6-bit opcode field. 


function code — ffff is the 16-bit function code in the 


displacement field. 


oo is the 6-bit opcode field. 
h is the high-order 2 bits of the 


| Memory/ 
- branch 


displacement field. 
Operate ooff oo is the 6-bit opcode field. 
ff is the 7-bit function code field. 
00 oo is the 6-bit opcode field; the 


particular PALcode instruction is 
specified in the 26-bit function code 
field. 
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IEEE and VAX floating-point instructions are shown in Tables A-5 and 
respectively. 


Table A-2 Architecture Instructions 


Mnemonic Format Opcode Description * 
ADDF F-P 15.080 

ADDG F-P 15.0A0 

ADDL Opr 10.00 

ADDL/V Opr 10.40 


ADDQ Opr 
ADDQ/V Opr 
ADDS F-P 
ADDT : F-P 
AND Opr 
BEQ Bra 
BGE Bra 
BGT Bra 


BIC Opr 
BIS Opr 
BLBC Bra : Branch if low bit clear 
Branch if low bit set 
Branch if < zero 
Branch if < zero 
Branch if 4 zero 
Unconditional branch 
.Branch to subroutine 
Trap to PALcode 
CMOVE if = zero 
CMOVE if > zero 
CMOVE if > zero 
CMOVE if low bit clear 
CMOVE if low bit set 
CMOVE if < zero 
CMOVE if < zero 
CMOVE if # zero 


Compare byte 
Compare signed quadword equal 
Compare G_floating equal 


(continued on next page) 
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Table A-2 (Cont.) Architecture Instructions 


Mnemonic Format Opcode 
CMPGLE F-P 15.0A7 
CMPGLT F-P 15.0A6 
CMPLE Opr 10.6D 
CMPLT Opr 10.4D 
CMPTEQ F-P 16.0A5 
CMPTLE F-P 16.0A7 
CMPTLT F-P 16.0A6 
CMPTUN F-P 16.0A4 
CMPULE 

CMPULT 

CPYS 

CPYSE 















Description 





“floating unordered 
re unsigned quadword 


Copy sign and exponent 


Copy sign negate 

Convert D_floating to G_floating 
Convert G_floating to D_floating 
Convert G_floating to F_floating 
Convert G_floating to quadword 
Convert longword to quadword 
Convert quadword to F_floating 
Convert quadword to G_floating 
Convert quadword to longword 
Convert quadword to longword 
Convert quadword to longword 
Convert quadword to S_floating 
Convert quadword to T_floating 
Convert S_floating to T_floating 
Convert T_floating to quadword 


Convert T_floating to S_ floating 
Divide F_floating 
Divide G_floating 
Divide S_floating 
Divide T_floating 


(continued on next page) 
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Table A-2 (Cont.) Architecture Instructions 


Mnemonic 


EQV 
EXCB 
EXTBL 
EXTLH 
EXTLL 
EXTQH 
EXTQL 
EXTWH 
EXTWL 
FBEQ 


FBGE 


FCMOVEQ 
FCMOVGE 
FCMOVGT 
FCMOVLE 
FCMOVLT 
FCMOVNE 
FETCH 
FETCH_M 
INSBL 
INSLH 


INSLL 
INSQH 
INSQL 
INSWH 





Format 


Opr 
Mfc 
Opr 
Opr 
Opr 
Opr 
Opr 
Opr 
Opr 
Bra 


Bra 
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Opcode 


11.48 
18.0400 
12.06 
12.6A 
12.26 
12.7A 
12.36 
12.5A 
12.16 
31 
































Description 


‘CMOVE if > zero 
FCMOVE if > zero 
FCMOVE if < zero 
FCMOVE if < zero 
FCMOVE if # zero 
Prefetch data 

Prefetch data, modify intent 
Insert byte low 

Insert longword high 





Insert longword low 
Insert quadword high 
Insert quadword low 
Insert word high 
Insert word low 
Jump 

Jump to subroutine 
Jump to subroutine return 
Load address 

Load address high 
Load F floating 

Load G_floating 


(continued on next page) 


Table A-2 (Cont.) Architecture Instructions 



















-Mnemonic Format Opcode Description 

LDL Mem 28 

LDL_L Mem 2A 

LDQ Mem 29 

LDQ_L Mem 2B 

LDQ U Mem 0B 

LDS Mem 22 

LDT Mem 23 

MB : Mfc 18.4000 

MF_FPCR F-P 

MSKBL Opr 

MSKLH Opr a 

MSKLL Opr Mask longword low 
MSKQH Opr Mask quadword high 
MSKQL Mask quadword low 
MSKWH Mask word high 


MSKWL 
MT_FPCR 


: Mask word low 
Move to FPCR 
Multiply F_floating 





Multiply G_floating 
Multiply longword 


Multiply quadword 


_ Multiply S_floating 
Multiply T_floating 
Logical sum with complement 
Read and clear 
Return from subroutine 
Read process cycle counter 
Read and set 
Scaled add longword by 4 
Scaled add quadword by 4 
Scaled subtract longword by 4 


Scaled subtract quadword by 4 
Scaled add longword by 8 


(continued on next page) 
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Table A-2 (Cont.) Architecture Instructions 


Mnemonic 


S8ADDQ 
S8SUBL 


Format 


Opr 
Opr 
Opr 
Opr 
Opr 
Opr — 
Mem 
Mem | 
Mem 
Mem 
Mem 
Mem 
Mem 


Mem 
Mem 
F-P 
F-P 
Opr 
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Opcode 


10.32 
10.1B 
10.3B 
12.39 
12.3C 

























Description 


Scaled add quadw 
Scaled subtract 


jabtract G_floating 


Subtract longword 


Subtract quadword 


Subtract S_floating 

Subtract T_floating 

Trap barrier 

Unsigned multiply quadword 
high 

Write memory barrier 
Logical difference 


‘Zero bytes 


Zero bytes not 


A.1.2 Opcodes Reserved for PALcode 


























Table A-~3 Opcodes Reserved for Digital 
Mnemonic Opcode Mnemonic Opcode Mnemonic 
OPC01 01 OPC05 05 ~  OPCOB 
OPCO2 02 OPC06 06 OPCOC 
OPC03 03 OPC07 07 OPCOD 
OPC04 04 OPCOA 0A 


Table A—4 lists the 21164-specific instructions 
Section 6.6. 





21164 

Mnemonic Opcode 

HW_LD 1B : Performs Dstream loads. 

HW_ST 1F Performs Dstream stores. 

HW_REI 1E Returns instruction flow to the 
program counter (PC) pointed 
to by EXC_ADDR internal 
processor register (IPR). 

HW_MFPR Accesses the Ibox, Mbox, and 


Deache IPRs. 


Accesses the Ibox, Mbox, and 
Deache IPRs. 


poi: instructions, with and without qualifiers. The opcode for 
tructions is 164. 
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Table A-5 IEEE Floating-Point Instruction Function Codes 























None /C IM /D /U /uc /UM 
ADDS 080 000 040 0CO 180 100 140 
ADDT O0AO 020 060 0EO 1A0 120 : 
CMPTEQ 0A5 
CMPTLT OA6 
CMPTLE 0A7 
CMPTUN 0A4 
CVTQS OBC 03C 07C 
CVTQT OBE 03E O7E 
CVTTS OAC 02C 06C 
DIVS 083 003 043 
DIVT 0A3 023 063 
MULS 082 002 042 
MULT 0A2 022 062 
SUBS 081 001 041. 
SUBT 0A1 021 061 
/SU 


(continued on next page) 
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Table A-5 (Cont.) IEEE Floating-Point Instruction Function Codes 











None /C N NC ISV ISvVCISVI 
CVTTQ OAF 02F 1AF 12F 5AF  52F 


D ND SVD «6/SVID) | /M VM 
5EF 7EF O6F 





the Ffbit function code field for the 
».for these instructions is 154g. 











ADDF 


CVTDG 41E 59K 51E 


ADDG 420 5A0 520 
CMPGEQ 4A5 
CMPGLT 4A6 
CMPGLE 4A7 
CVTGF 12C 4AC 42C 5AC 52C 


12D 4AD 42D 5AD 52D 


103 483 403 583 503 
123 §=4A3 423 5A3 523 
102 482 402 582 502 
122 4A2 422, 5A2 522 
101 481 401 581 501 


(continued on next page) 
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Table A-6 (Cont.) VAX Floating-Point Instruction Function Codes 
None /C lu uc OS ISC «ISU 
SUBG 0OA1 021 1A1 121 4A1 421 ‘BAL . 









SUC 


None /C N NC IS isc. IS¥ 
OAF 02F 1AF 12F 4AF 












A.4 Opcode Summary 


- Table A~7 lists all Alpha AXP opcodes from 00 
In the table, the column headings that appe 
granularity of 83g. The rows beneath. the O 









replace that 8 with the pun 
column. 





For age the third row ae 
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Table A-7 Opcode Summary 





























Offset 00 08 10 18 20 28 
0/8 PAL* LDA INTA* MISC* 
(pal) (mem) (op) (mem) 
1/9 Res LDAH INTL* \PALN\ 
(mem) (op) 
ZA Res Res INTS* JSR* 
(op) (mem) 
3/B Res LDQ _U INTM* \PALN 
(mem) (op) 
4/C Res Res Res Res 


5/D Res Res FLTV* 
(op) 


6/E Res Res FLTI* 


eaning 

TEEE fonting:poliik instructio 
FLTL* Floating-point operate instruct i 
FLTV* VAX floatin point instruction 
INTA* Integer arithmetic:ir 
INTL* Integer logical indtPaction ope 
INTM* Integer att oy instrugti 
INTS* Integer shift 3 cti 


\PAL\N Reserved for PALco é 
Res Reserved for Digital 


‘in Table A-8 are required for all Alpha AXP 
fentations. The notation used is oo.ffff, where oo is the hexadecimal 
‘ode.and ffff is the hexadecimal 26-bit function code. 
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Table A-8 Required PALcode Function Codes 


Mnemonic Type Function Code 
DRAINA Privileged 00.0002 
HALT Privileged 00.0000 
IMB Unprivileged 00.0086 


A.6 Alpha 21164 Microprocessor IEEE Floatifig- 
Conformance 


The 21164 supports the IEEE fipatingpoit operations as eg by the 


Architecture Reference Manual. 


The following information is s ee 
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Divide-by-zero (DZE) 








Floating overflow (OVF) 


The floating overflow trap is always enab 
destination register is eed EO 









: x. The 21164 never produces a denormal 
instead. 
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e Inexact (INE) 


The inexact trap can be disabled. The destination register always co! 
the properly rounded result, whether the trap is enabled. The excep 









Harpeeire results, this requires both an infinite precisi : 
fraction. For aie results, this requires an infinite pred 


with integer overflow. All valid yee 
overflow and underflow. 








Integer overflow (IOV) 


The integer overflow trap isabled.*“fhe destination register always . 


<32>) of the true result (not the 


ting to quadword integer or longword 
if the rounded result is outside the 


exceptions concurrent with the assertion of the SWC 
it sets EXC_SUMISWC]. 








pes Occur, then only the input exception is recorded in the FPCR and only 
€ input exception is signaled to the Ibox. 
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Alpha 21164 Micro} rocessor 


Table B—1 lists specifications for the 21164. 
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Table B—1 Alpha 21164 Microprocessor Specifications 





















Feature Description 

Cycle time range 4.4 ns to 3.2 ns. 

Process technology 0.5 micron CMOS. 

Die size 664 X 732 mils. 

Package 499-pin IPGA (interstitial pin grid array). 
Number of signal pins 291. 
Maximum power 45 W @ 3.75 ns cycle time (266 MHz), Vdde3. 
dissipation (typ) 


Clocking input Hz ata 
3.5-ns “oul time). 

Virtual address size 43 bits. 

Physical address size 40 bits. 

Page size 8K byte. 

Issue rate 4 instructions per cycle 

Integer instruction 7 stage. : 

pipeline 

Floating instruction 

pipeline 

On-chip Dcache 


On-chip Icache 
On-chip Scache 


On-chip data translation 
buffer 

On-chip instruction | 
translation buffer 


fully Associative, not-last-used replacement, 8K pages, 128 
(Xx ASN=127), full granularity hint support. 


fly associative, not-last-used replacement, 128 ASNs 
ASN=127), full granularity hint support. 


On-chip FPU supports both IEEE and Digital floating point. 
Separate 128-bit data and address bus. 
Allows microprocessor to access a serial ROM. 







Floating-point uri 








ion scales linearly with frequency over the frequency range 225 MHz to 312 MHz. 
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Table C-1 lists the revision history for this docume! 


Table C-1 Document Revision History 
Date 


July 20, 1994 
September 12, 1994 


Revision 
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Technical ar ise ore Resa 


This appendix describes how to: 


¢ Obtain id peraicondnerar oe : 


Order Number 
21-40658—0x 


21A01—xx 
tpha 21164 Wieaipacenser Evaluation Board Design EB164—xx 
ackage . 
Heat Sink Assembly Type 1 XXXXK—XX 
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Product : Order Number 











Heat Sink Assembly Type 2 | XXXXX—XX 


D.3 Ordering Digital Semiconductor ey Kits 


order the following products. 






Product 


Alpha 21164—xxx Sample Kit 
Alpha 21164—xxx Sample Kit 
‘Alpha 21164—xxx Sample Kit 










Order Number 


EY-L520E—-DP-YCH 
EC-QAENA-TE 
EC—QAEPA-TE 
EC—QAEQA-TE 
EC—QAExx—TE 
ewlett-Packard Logic Analyzer EC-X2454—72 
“purchase the Alpha Architecture Reference Manual, call 1-800-DIGITAL from 


jada, or contact your local Digital office, or technical or reference bookstore where 
0 re distributed by Prentice Hall. 
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D.5 Ordering Associated Third-Party Literature 
You can order the following third-party literature directly from the v 


Title 
PCI System Design Guide 










Vendor 


PCI Special Interest Group: 
M/S HF3-15A 
5200 N.E. Elam Young:Pkwy 
Hillsboro, Oregon 974: 
1-503-696-2000 2" 
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abort 

The unit stops the operation it is perform i 
some other operation. i. 

ABT 

Advanced bipolar/CMOS tech 


address space number (ASN: 


An optionally agers Tregist sed to reduce the need for invalidation 








Application-specific integrated circuit. 


Glossary—1 


ASN 
See address space number. 
























assert 
To cause a signal to change to its logical true state. 


AST 
See asynchronous system trap. 


asynchronous system trap (AST) 


A software-simulated interrupt to a user-defined routi 
process to be notified eee with respett: 


backmap 
A memory unit that is used to not 


bandwidth 
Bandwidth is often used to: 


Bcache 


See external cache. 


barrier trans 


A transactio 
instruction: 


al interface as a result of an MB (memory barrier) 





Bipolar/CMOS. The combination of bipolar and MOSFET transistors in a 
mmon integrated circuit. 


Glossary—2 






bidirectional 


Flowing in two directions. The buses are bidirectional; they carry bo 
- and output signals. 






BiSr 

Built-in self-repair. 
BiSt 

Built-in self-test. 
bit 
Binary digit. The smallest unit of data in a 
as Oor 1. . 


bifiar stem, designated 






BU 
Bus interface unit. See Cbox. 


block exchange 


Memory feature that improves:bus b 


idwidthby paralleling a cache victim 
write-back with a cache miss:fil. 









board-level cache 


See external cache: 


boot 


Short for bootst ditigsan operating system into memory is called 
booting. 


BSR 





jgcheck 


software condition, usually the response to software’s detection of an 
“internal inconsistency,” which results in the execution of the system bugcheck 
de. 
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bus 


A group of signals that consists of many transmission lines or wires. It® 
interconnects computer system components to provide communications P 
for addresses, data, and control information. 


byte 
Eight contiguous bits starting on an addressable byte boundary. 
numbered right to left, 0 through 7. 



















byte granularity 


‘Memory systems are said to have byte granularity if adjacent bytes can be 
written concurrently and independently by different.proces: processors. 





cache 
See cache memory. 


cache biock 


The smallest unit of storage that 
Also known as a cache line. 


cache coherence 
Maintaining cache coneren’ 
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cache interference 


The result of an operation that adversely affects the mechanisms an 
procedures used to keep frequently used items in a cache. Such interf ance 
may cause frequently used items to be removed from a cachg 
significant overhead operations to ensure correct results. Ei 
hampers performance. 



















cache line 
See cache block. 


cache line buffer 
A buffer used to store a block of cache memo: 


cache memory 


speed. It contains copies of data recerit f 
several bytes of data from memory 
ié € Alpha 21164 microprocessor 


contains three on-chip inter lso write-through cache and 


write-back cache. 


cache miss 


fe unit of the computer that is responsible for interpreting and executing 
instructions. 
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-cisc 


Complex instruction set computer. An instruction set consisting of a larg 
number of complex instructions that are managed by microcode. Contras 
RISC. 















clean 


In the cache of a system bus node, refers to a cache line that is Vak 
not been written. 


clock 
A signal used to synchronize the circuits in a comput 


CMOS 


conditonal branch instructions 
Instructions that test a register for po 


e bus used to carry data between the 21164 and external devices. Also 
ad the pin bus. 
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Dcache 


Data cache. A cache reserved for storage of data. The Deache does nét: 
instructions. 





















DIP 
Dual inline package. 


direct-mapping cache 
A cache organization in which only one address co 
any data in the cache, because any block of main 
in only one possible position in the cache. 
direct memory access (DMA) 

Access to memory by an I/O device that 
dirty 
One status item for a cache block. T 
written so that it may differ fron#th 
dirty victim | 
Used in reference to a cack 


evaluation board. A hardware/software applications development platform 
for the Alpha AXP program and a debug platform for the Alpha 21164 
icroprocessor. 
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Ebox 
The Ebox contains the 64-bit integer execution data path. 


ECC 


Error correction code. Code and algorithms used by logic to facil 
detection and correction. See also ECC error. 


ECC error 
An error detected by ECC logic, to indicate that data (6 


(hard error). | | = 
ECL 
Emitter-coupled logic. 


EEPROM 


Electrically erasable programmable read-o emory;*A memory device that 
can be byte-erased, written to, and Eoutrast with FEPROM. 


EPLD 
Erasable programmable logic ; 


“g£.the microprocessor chip, usually located 
‘alled boayd-level or module-level cache. 













Fbox 


calculations. | 
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floating point 






A number system in which the position of the radix point is indicate the 
exponent part and another part represents the significant digits or fraétional 
part. 














flush 
See cache flush. 


FPGA 
Field-programmable gate array. 


FPLA 
Field-programmable logic array. 


granularity 


A characteristic of storage systems tlt. defin 
can be read and/or written with a sin 
independently. VAX systems oe é 





; he amount of data that 
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_fcache 
Instruction cache. A cache reserved for storage of instructions. One of tl 


blocks, or lines, contain 32 bytes of instruction stream data with 
as well asa 6-bit ASM field and an 8-bit branch history field ; ger 


is unaffected by the invalidate bus. 


IEEE Standard 754 


A set of formats and operations that apply to floating-péi 
formats cover 32-, 64-, and 80-bit operand sizes. ....... 


IEEE Standard 1149.1 
shitecture used in 


JFET 
Junction fiel 






latency 


feedback shift register. 
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load/store architecture 




















~ instruction set. 


longword 


Four contiguous bytes starting on an arbitrary byt 
numbered from right to left, 0 through 31. 


LSB 
Least significant bit. 
machine check 


in system hardware-detected 


errors that can be fatal to ten ‘ope tion. triggéred, machine check 


handler software analyzes the error. 


MAF 
Miss address file. 


main memory 


sof the processor unit performs address translation, interfaces to 
e Deache, and performs several other functions. 


MBZ 


e must be zero. 
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MESI protocol 


A cache consistency protocol with full support for multiprocessing. The MESI 
protocol consists of four states that define whether a block is modified (M)3=: 
exclusive (E), shared (S), or invalid (I). 


MIPS 
Millions of instructions per second. 
miss 
See cache miss. 
~ module | 
A board on which logic devices (such as transigto1 

















module-level cache 
See external cache. 


MOS 


Metal-oxide-semiconductor. 


MOSFET | 
Metal-oxide-semicoridiictor 


MSI 
Medium-scale integrati 


at:replicates the sequential computer and interconnects 
; eacli processor can execute the same or a different 


Must be zero ( BZ) 


field that is reserved and must be supplied as zero. If examined, it must be 
umed to be UNDEFINED. 








NAF SALLY ALIGNED 
See ALIGNED. 
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NATURALLY ALIGNED data 


Data stored in memory such that the address of the data is evenly di¥ 
the size of the data in bytes. For example, an ALIGNED longword i is $ med 
such that the address of the longword is evenly divisible by 425508 





















NMOS 
N-type metal-oxide-semiconductor. 


NVRAM 


Nonvolatile random-access memory. 


OBL | 
Observability linear feedback shift register 


otewow 


machines. 


operand 
The data or r. 


ary. See PALcode. Also Programmable array 
at can be programmed by a process that blows 


special environment for running PALcode routines. 


" parameter 


variable that is given a specific value that is passed to a program before 
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parity 
A method for checking the accuracy of data by calculating the sum of 
number of ones in a piece of binary data. Even parity requires the orrect 


sum to be an even number, odd parity requires the correct sum t 
number. 























PGA 
Pin grid array. 


pipeline 


A CPU design technique whereby multiple instruction: are 
overlapped in execution. : 


PLA | 
Programmable logic array. 


PLCC 
Plastic leadless chip carrier or plasti 


PLD 
Programmable logic device. 


PLL 
Phase-locked looy 


PMOS 
P-type metal-oxid 


PQFP 
Plastic quad 


primary. che 







at portion of the CPU that contains the virtual address of the next 
truction to be executed. Most current CPUs implement the program counter 

?€as a register. This register may be visible to the programmer through the 
instruction set. 
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PROM 
Programmable read-only memory. 

















pull-down resistor ; 
A resistor placed between a signal line and a negative voltag 


pull-up resistor 
A resistor placed between a signal line to a positive 


quad issue 


Four instructions are issued, in parallel, during the'sar 
The instructions use different resources and:$6'% ¢ 


quadword 


Eight contiguous bytes starting 6: 
numbered from right to left, 0 ihren ak z 


RAM 


Random-access memory. 


READ_BLOCK 


conditions. 
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reset 


An action that causes a logic unit to interrupt the task it is performing a 
to its’ initialized state. 





















t. ZO 


RISC 


a small, hardeare-aacisted instruction set, thus elimiriz’ 
microcode. 


ROM 
Read-only memory. 


RTL 
Register-transfer logic. 


SAM 


Serial access memory. 


SBO 
Should be one. 


SBZ 
Should be zero. 


Scache 
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set-associative 




















A form of cache daar in which the location of a data block in 


one possible cache location, and fully associative organizati 
from anywhere in main memory can be put anywhere i in the 
set-associative” cache allows data from a given address: 
cached in any of n locations. The Scache in the 2 
three-way set-associative organization. 
SIMM 


Single inline memory module. 
SIP 

Single inline package. 
SIPP 

Single inline pin package. 


SMD 
Surface mount device. 


area of memory set aside for temporary data storage or for procedure and 
interrupt service linkages. A stack uses the last-in/first-out concept. As items 
re added to (pushed on) the stack, the stack pointer decrements. As items are 
Sti ieved from (popped off) the stack, the stack pointer increments. 
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STRAM 
Self-timed random-access memory. 


superpipelined 


Describes a pipelined machine that has a larger number of pi 
more complex scheduling and control. See also pipeline. 


superscalar 
Describes a machine architecture that allows multipl 
to be issued in parallel during a given clock cycle. 
tag 


The part of a cache block that holds the addreg¢info 
if a memory operation is a hit or a miss on th: 


d to determine 


TB 
Translation buffer. 


tristate 


Refers to a bused line that has.t igh, low, and high-impedance. 


TTL 


e ration that may halt the processor or cause it to lose information. Only 
iviiéged software (that is, software running in kernel mode) can trigger an 
EFINED operation. 
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UNPREDICTABLE 















occurrences. 


UVPROM 
Ultraviolet (erasable) programmable read-only mem 


valid 
Allocated. Valid cache blocks have been loaded with d ay return 
cache hits when accessed. 
victim 


Used in reference to a cache bloeisin theé AC 
cache block is valid but is about to fe 
conflict. 


e of a system bus node. The 


virtual cache 
A cache that is addressed 
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write-back 


A cache management technique in which write operation data is written'3h 
cache but is not written into main memory in the same operation. This mg 
result in temporary differences between cache data and main memiory¢d, 

Some logic unit must maintain coherency between cache and mai memory. 










write-back cache 


Copies are kept of any data in the region; read and writs 
the copies, and write operations use additional state x 
there are other copies to invalidate or update. 


write-through 


A cache management technique in which a wri S Opera ion té&eache also causes 
the same data to be written in main memory,during t 





write-through cache 


Copies are kept of any data in the region; ré¢ zt 
but write operations update the actuakdata location and either update or 
invalidate all copies. 






WRITE_BLOCK 


A transaction where the 2114 
write data. 
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A 


Aborts, 2-18 
Absolute Maximum Beene, 9-1 
ac coupling, 9-5 
Addressing, 1-2 
Address regions, physical, 4—12 
Address translation, 2-10 
Alpha AXP documentation, D-2 
ALT _ MODE register, 5-60 
Architecture, 1-1 to 1-3 
Associated literature, D-2 
AST, 2-8 
ASTER register, 5-26 
ASTRR register, 5-25 
Asynchronous system tra: 
See AST 


Beache, 2-13 

block size, 4—15 
- hit under READ MESS éxample, 
interface, 4 


FIG register, 5-84 
[TROL register, 5-78 






487 






















Bus interface unit 


* See BIU 


Cc 


Cache coherency, 4-18 to 4-27 
basics, 4-19 
flush protocol, 4~—20 
flush protocol state machines, 4-26 _ 
flush protocol systems, 4-24 
transaction conflicts, 4—27 
write invalidate protocol, 4—20 
write invalidate protocol state machines, 
4-23 
write invalidate protocol states, 4-22 
write invalidate protocol systems, 4-21 
Cache control and bus interface unit 
See Chox 
Cache organization, 2-12 
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Chox, 2-12 
IPR PAL restrictions, 5—99 
IPRs, 5-68 to 5-97 
read requests, 2-30 
write buffer data store, 2—34 
write ordering, 2-36 

CC register, 5—61 

CC_CTL register, 5-62 

Clocks, 4-5 to 4-12 
CPU, 45 
reference, 4-8, 4-9 
system, 4-6 

Commands 
21164 initiated, 4-34 
BCACHE VICTIM, 4—36 
FETCH, 4-35 . 
FETCH_M, 4-35 
FLUSH, 4-62 
INVALIDATE, 4-53 
LOCK, 4-35 
MEMORY BARRIER, 4-35 
NOP, 4-35, 4-53, 4-62 
READ, 4-62 
READ DIRTY, 4-53 


READ DIRTY/INVALIDATE, 4-54 


READ MISSO, 4-36 
READ MISS1, 4-36 : 
READ MISS MODO, 4—3¢ 
READ MISS MOD1, 4-36 ° 
READ MISS MOD STCO, 4-3 
READ MISS MOD STC1, .4-37 
SET DIRTY, 4—35 
SET SHARED, 4-53 
WRITE BLOCK, 4— 
WRITE BLOCK EQCK: 
Conventions, xxii, t8:xxvii 
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D 


Data cache 

See Deache 
Data integrity, 4-89 
address and command % 
Bcache tag control parity, 





PE’ Re: STAT register, 5-50 
TEST CTL register, 5-63 
‘DE: TEST TAG register, 5-64 
DC:SEST_TAG TEMP register, 5-66 
Decoupling, 9-20 
‘Design examples, 2-39 
“Documentation, D-2 
DTB, 2-10 
DTBIAP register, 5-52 
DTBIA register, 5~-52 
DTBIS register, 5—53 
DTB_ASN register, 5—38 
DTB_CM register, 5—39 
DTB_PTE register, 5-41 
DTB_PTE_TEMP register, 5—43 
DTB_TAG register, 5—40 
Duplicate tag store, 4—15 

algorithm, 4-17 

full, 4-15 

partial, 4-18 








E 


Ebox, 2-9 

registers, 2-9, 5-98 
ECC, 4-89 to 4-91 
EI_ADDR register, 5-93 
EL STAT register, 5-90 
Entry pointer queues, 2-34 
Environment instructions 

PALcode, 6—7 
Error correction code 

See ECC 
Exceptions, 2-18 
EXC_ADDR register, 5-14 
EXC_MASK register, 5-17 
EXC_SUM register, 5—15 
External cache 

See Bcache 
External interface 

rules for use, 4~80 


External interface introduction, 4—2 to 4-£ 


- 


Features, 1-3 to 1-4 
FETCH command, 4-35, 
FETCH_M command, 
Fill, 2-31, 4-79 
after other transactions, 
FILL error, 4-92 
FILL transaction, 44: 
fill_h signal : 
using, 4-70 
FILL_SYN register, 





ry queue, 2-34 


H 


Heat sink, 10-3 
Hint bits, 2-10 
HWINT_CLR register 
HW_LD Instruction, “633: 
HW_MFPR Instruction, ™ 
HW_MTPR Instruétien, 6— 




























€PERR_STAT register, 5—13 
ICSR register, 5—20 
IC_FLUSH_CTL register, 5-13 
idle_be_h signal 

length of assertion, 4—70 

using, 4-70 
IEEE floating-point conformance, A-12 
IFAULT_VA_FORM register, 5-11 
Initialization 

role of interrupt signals, 4-93 
Input clock 

ac coupling, 9-5 

impedance levels, 9—5 

termination, 9—5 
Input clocks, 9-4 
Instruction 
decode, 2-5 
issue, 2-5 


Index-3 





Instruction cache 
See Icache 
Instruction fetch/decode unit and branch unit 
See Ibox 
Instruction issue, 1—3, 2-17 
Instructions 
classes, 2-20 
issue rules, 2—27 
latencies, 2-23, 2-24 
MB, 2-12 
slotting, 2-20, 2-21 
WMB, 2-12, 2-34 
Instruction translation buffer, 2-7 
See ITB 
Integer execution unit 
See Ebox 
Integer register.file 
See IRF 
Interface restrictions, 4—79 
Interface transactions 
21164 initiated, 4-34 to 4-50 
system initiated, 4-51 to 4-67 
Internal processor registers 
See IPRs 
Interrupts, 4-93 to 4-95 
ASTs, 2-8 
disabling, 2-9 
hardware, 2-8 
initialization, 4-93 
normal operation, 4-93 
priority level, 4-93 
software, 2-8 
Interrupt signals, 4-93 : 
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IPRs (cont'd) 





































BC_CONFIG, 5-84 
BC_CONTROL, 5-78 
BC_TAG_ADDR, 5-88 
CC, 5-61 

CC_CTL, 5-62 
DC_FLUSH, 5-60 : 
DC_MODE, 5-56 


"eter 


EEL 

HWINT_CLR, 5-28 
ICPERR_STAT, 5-13 
ICSR, 2-9, 5-20 
IC_FLUSH_CTL, 5-13 
IFAULT VA_FORM, 5-11 
INTID, 5-24 

IPL, 2-9, 5-23 

ISR, 5-29 

ITB_ASN, 5-8 

ITB IA, 5-9 
ITB_IAP, 5-9 
ITB_IS, 5-10 
ITB_PTE, 5-6 
ITB_PTE_ TEMP, 5-9 
ITB_TAG, 5-5 
IVPTBR, 5-12 

MAF MODE, 5-58 
MCSR, 5-54 
MM_STAT, 5-44 


IPRs (cont’d) 
MVPTBR, 5-49 


PAL_BASE, 5-18, 6-3 


PMCTR, 5-33 
PS, 5-19 
reset state, 7—9 
SC_ADDR, 5-75 
SC_CTL, 5-69 
SC_STAT, 5-72 
SIRR, 5-27 
SL_RCV, 5-32 
SL_XMIT, 5-31 
VA, 5-46 
VA_FORM, 5-47 

IRF, 2-9 

ISR register, 5—29 

Issue rules, 2—27 


Issuing rules, 2-19 to 2-28 


ITB, 2-7 

ITB_ASN register, 5-8 
’ ITB_IAP register, 5-9 
ITB_IA register, 5-9 

ITB_IS register, 5—10 
ITB_PTE register, 5-6 


ITB_PTE_TEMP register, 


ITB_TAG register, 5—5 
IVPTBR register, 5-12 


L 


Latencies, 2—23, 2-24 
Literature, D-2 | 
‘Live lock 

cache conflict, 44 
Load-after-store tr Pp; 
































MAF, 2-11, 2-29 to 2-32 
entries, 2-31 
entry, 2-32 
rules, 2-29 

MAF MODE register, 





MEMDRY BARRIER command, 4-35 
Memory regions, physical, 4-12 
Merge 
write buffer, 4-14 
Merging 
loads to noncacheable space, 2-30 
rules, 2-29 
- Microarchitecture, 2-2 to 2-13 
Miss address file 
See MAF 
' MM_STAT register, 5-44 
Multiple instruction issue, 2—5 
MVPTBR register, 5-49 


Noncached read operations, 4-13 
Noncached write operations, 4-14 


Nonissue conditions, 2—19 
NOP command, 4-35, 4-53, 4-62 
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0 


Operating temperature, 10-1 
Ordering products, D-1 


p 


Page table entry 

See PTE 
PAL 

restrictions, 5-100 
PALcode, 1-2 


environment instructions, 6-7 


invoke, 6-3 
PALmode, 6-2 
environment, 6—2 
PALshadow registers, 5-98 
PALtemp IPRs, 5-98 
encoding, 5-2 
PAL_BASE IPR, 6-3 
PAL_BASE register, 5-18 
Parity, 4-89 . 
Parts 
ordering, D-1 
Pending request queue, 2-34 
Performance counters, 


Physical address regions, 4-1 
Physical memory regions, 4~12 
Pipeline organization, 
Pipelines, 2-9 
bubbles, 2-19 
examples, a 
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2-13 to. 2— 


PMCTR register, 5—33 

Power supply 
considerations, 9-19 
decoupling, 9-20 

sequencing, 9-20 
































Producer—consume 
Producer—produce¥ d 


READ ‘command, 462. 


“READ DIRTY command, 4-53 
“READ DIRTY/INVALIDATE command, 4—54 





READ DIRTY/INVALIDATE transaction, 
4-56 

READ DIRTY timing diagram, 4—56 

READ DIRTY transaction, 4-56 

READ MISSO command, 4-36 

READ MISS1 command, 4-36 

READ MISS MOD0 command, 4-36 

READ MISS MOD1 command, 4-36 

READ MISS MOD STCO command, 4-37 

READ MISS MOD STC1 command, 4-37 

READ MISS no Beache timing diagram, 
4-38 

READ MISS timing diagram, 4-39 

READ MISS transaction, 4-39 


READ MISS transaction (no Beache), 4-38 
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example, 4-85 
READ MISS with victim abort example, 
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READ MISS with victim example, 4-81 
READ MISS with victim timing diagram, 
4-43, 4-44 
READ MISS with victim transaction, 441 
READ timing diagram, 4-66 
READ transaction, 4-66 
Read/write spacing 
data bus contention, 4-69 
Reference clock, 4-8, 4-9 
example 1, 4-10 
example 2, 4-11 
examples, 4-9 
Registers 
See also IPRs 
accessibility, 5-1 
integer, 2-9 
PALshadow, 2-9, 5-98 
PALtemp, 5-98 
Related documentation, D-—2 
Replay traps, 2-28 to 2-29 .. 
as aborts, 2-19 
load instruction, 2-11 2-32 2 
load-miss-and-use, 2: 
Reset 
forcing, 4-92 
Resource conflict, 2-19 
Restrictions 
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See SROM 

SET DIRTY command, 4-35 
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Technical support, D-1 
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Thermal resistance, 10-1 
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Timing diagrams 
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READ, 4-66 
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Timing diagrams (cont'd) 
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SET DIRTY, 448 
SET SHARED, 4-60 
WRITE BLOCK, 4-47 
Transactions 
FILL, 441 
FLUSH, 4-64 
INVALIDATE, 4-58 
LOCK, 448 
- READ, 4-66 
READ DIRTY, 4-56 
READ DIRTY/INVALIDATE, 4-56 
READ MISS, 4-39 
READ MISS (no Beache), 4-38 
READ MISS with victim, 4-41 
SET DIRTY, 4-48 
SET SHARED, 4-60 
system initiated, 4-51 
WRITE BLOCK, 4-46 
WRITE BLOCK LOCK, 4-46 
Traps 
load-after-store, 2-28 
load-miss-and-use, 2-27 
replay, 2-19, 2-28, 2-32 
Tristate 
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READ or WRITE to fill, 4-73 
system Bceache command to fill 47 : 





























V 


VA register, 5-46 
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Wave pipeli e, 431 


WRITE” B BLOCK LOCK transaction, 4—46 
WRITE BLOCK timing diagram, 4~47 
WR TE BLOCK transaction, 4—46 

Write buffer, 2-12, 2-34 to 2-36 
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